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issues  studied  in  this  dissertation.  Stochastic  operator  theory  and 
results  from  functional  analysis  allow  these  problems  to  be  solved 
for  a  wide  variety  of  random  scattering  media.  The  models  are  based 
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The  numerical  issues  involved  with  implementat ion  are  studied  in 
detail.  We  show  how  the  processor  equations  can  be  solved  using  robust, 
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between  the  Karhunen-Loeve  basis  and  canonical  matrix  decompositions 
are  established. 

In  summary,  ideas  from  system  modeling,  identification,  detection 
and  estimation  theory,  and  numerical  analysis  are  combined  in  order  to 
implement  optimal  array  processors  for  signal  detection  and  estimation 
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Chapter  1 


INTRODUCTION 


1 . 1  Problem  Statement  and  Its  Importance 

1.1.1  General  Problem  Statement 

This  dissertation  studies  several  issues  arising  in  implementing 
adaptive  array  processors  for  signal  detection  and  estimation.  We  shall 
focus  our  attention  on  detecting  signals  that  have  propagated  through 
randomly  time-varying  scattering  media.  The  stochastic  nature  of  the 
medium  causes  the  returned  signal  energy  to  be  a  random  process  spread 
with  respect  to  range,  angle,  and  Doppler.  These  effects  will  be 
modeled  by  linear,  bounded  stochastic  operators  acting  on  the  trans¬ 
mitted  signal.  An  operator  theoretic  approach  to  the  detection  problem 
provides  the  means  to  solve  for  the  optimal  processor  for  a  wide  class 
of  propagation  and  scattering  channels. 

One  can  show  that  the  optimal  array  processor  structure  can  be 
implemented  in  an  "estimator-correlator"  structure  (Figure  1-1)  [1]  [2], 
In  other  words,  the  array  measurements  are  directed  into  two  branches, 
each  of  which  is  described  by  a  matrix  filter  acting  on  the  data.  The 
matrix  filters  _£(  • ,  • )  and  G( • , • )  are  found  by  solving  the  following 
integral  equations: 


/  — m^c’  u)_2(u,  z)  =  6(t  -  z)  _I 
T 


/  R.(t,  u)  G(u,  z)  du  =  R  (t,  z) 
i,  —  1  —  —  y 

T  } 


(1.1-1) 


(1.1-2) 


«  n"**  «l 


where  Rjj(*,*)  is  the  covariance  kernel  of  the  measurement  noise,  jly(*,*) 
is  the  covariance  kernel  of  the  returned  signal  alone,  RjC*,*)  is  the 
covariance  kernel  of  combined  returned  signal  plus  noise,  T  is  the 
observation  interval,  and  _I  is  the  identity  matrix.  The  lower  branch 
containing  •,• )  will  be  called  the  estimator  branch,  because  it  calcu¬ 
lates  an  optimal  estimate  of  the  backscattered  signal  based  on  the  array 
data  _r( • , • ) .  The  upper  branch  will  be  referred  to  as  the  inverse  filter 
branch,  because  it  represents  the  inverse  of  The  outputs  of 

each  branch  are  correlated  to  form  the  scalar-valued  likelihood  ratio  i. 

In  principle,  there  is  no  reason  why  the  optimal  detector  could  not 
be  implemented  after  solving  these  equations.  However,  several 
problems  will  become  apparent  as  we  consider  in  detail  how  to 
implement  this  processor  in  a  practical  working  environment.  Solving 
the  equations  is  so  difficult  that  suboptimal  schemes  are  almost  always 
used. 


1.1.2  Specific  Problem  Statement 

The  specific  issues  which  shall  be  studied  in  this  dissertation  can 
be  summarized  as  follows: 

(1)  To  study  the  relationships  among  detection  theory,  estimation 
theory,  stochastic  system  modeling,  and  system  identification 
within  the  context  of  the  estimator-correlator  processor. 

(2)  To  establish  the  connections  among  Karhunen-Loeve  expansions, 
system  models,  generalized  Fourier  series,  canonical  matrix 
decompositions,  and  generalized  eigenvalue  matrix  decompositions. 

(3)  To  exploit  the  preceding  results  in  order  to  implement  the 
estimator-correlator  structure  with  state-of-the-art  computational 
algorithms . 
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1.1.3  Importance  of  the  Problem 

Most  adaptive  array  processor  designs  are  based  on  oversimplified 
descriptions  of  the  backscattering  object  and  transmission  medium.  They 
are  called  point  scatterer  models,  and  although  they  assume  random 
amplitude  and  phase,  their  essential  features  are  deterministic.  To  be 
more  precise,  the  backscattered  signal  is  represented  as  a  time-  and 
Doppler-shifted  version  of  the  original  signal  which  arrives  at  the 
array  from  one  specific  direction.  The  return  can  be  interpreted  as  a 
point  in  a  three-dimensional  space  parameterized  by  time  delay,  Doppler 
shift,  and  arrival  angle.  Implementing  these  processors  is 
straight  forward . 

However,  a  deterministic  treatment  of  transmission  channel  effects 
is  rarely  an  adequate  representation  of  reality.  Since  many  uncon¬ 
trolled  factors  determine  real  wave  motion,  stochastic  descriptions  are 
usually  more  appropriate.  Stochastic  media  cause  the  returning  signal 
energy  to  be  distributed,  or  spread,  over  random  intervals  of  time  delay 
(range),  Doppler,  and  arrival  angle.  One  can  tf ink  of  the  returned 
energy  as  distributed  over  a  volume  in  the  previously  defined  parameter 
space.  The  total  signal  energy  available  for  processing  is  recovered  bv 
integrating  over  the  entire  volume. 

This  discussion  leads  to  the  importance  of  this  work.  When  a 
processor  designed  for  point  channels  is  employed  in  a  stochastic 
environment,  only  a  fraction  of  the  returned  signal  energy  is  actually 
processed.  As  a  result,  the  overall  performance  of  the  detector  is 


significantly  reduced. 


Total  signal  energy  processed  determines  array  processor  perform¬ 
ance.  In  order  to  maximize  performance,  all  available  signal  energy 
must  be  used.  This  is  why  optimal  structures  must  be  implemented, 
especially  in  low  signal-to-noise  ratio  environments.  The  preceding 
comments  provide  the  underlying  motivation  for  the  work  presented  in 
this  dissertation. 

1 . 2  Problem  Formulation 

A  schematic  diagram  of  the  working  environment  is  shown  in 
Figure  1-2.  A  signal  s(*)  is  transmitted  into  the  medium  over  a  finite 
time  interval.  s(*)  shall  be  called  the  probing  signal,  and  we  will 
assume  its  characteristics  are  known  and  subject  to  our  control.  If  a 
reflecting  object  is  present  in  the  medium,  then  energy  from  the  probing 
signal  scatters  off  the  object  in  different  directions.  A  collection  of 
sensors  called  the  array  is  immersed  in  the  medium  and  is  designed  to  be 
sensitive  to  the  signal  energy  reflected  from  the  object. 

All  signals  not  related  to  the  backscattered  probing  signal 
represent  undesired  interference,  and  will  be  called  noise.  It  has  two 
components:  system  noise  which  arises  within  the  processor,  and 

ambient  noise  which  enters  the  system  through  the  array.  The  ambient 
noise  is  spatiallv  distributed,  but  not  necessarily  isotropically.  We 
shall  combine  the  internal  and  external  noises  into  a  single  process 
modeled  by  a  zero-mean,  Gaussian  distributed  random  vector. 

The  total  output  of  the  arrav  is  measured  over  a  tine  interval  T 
and  a  spatial  aperture  A.  G*-r  objective  is  to  design  a  processing 
system  which  takes  the  arrav  data  and  decides  if  backscattered  signal 
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energy  is  absent  or  present  in  the  measurement,  in  other  words,  if  the 
object  is  absent  or  present. 

This  problem  can  be  formulated  in  mathematical  terms  as  follows. 

The  array  output  shall  be  denoted  by  r(*,‘).  It  is  a  complex-valued 
vector  stochastic  process,  and  the  i-th  element  of  ri t ,  w)  represents 
the  measured  value  of  the  i-th  sensor  in  the  array  at  time  t. 

The  background  and  measurement  noise  entering  the  processing  svstem 
will  be  denoted  by  nC*,*).  n( •  ,  •  )  is  a  zero-mean,  Gaussian  distributed 
stochastic  process. 

The  energv  backscattered  off  the  reflecting  object  will  he  denoted 
by  £s(*),  or  by  v(*,*).  This  process  is  inherently  random  even  though 
s  C  * )  is  known;  therefore,  it  must  be  modeled  as  a  stochastic  process  as 
well.  £( •  )  represents  the  effects  of  the  scattering  object  and  propa¬ 
gation  channel.  We  shall  refer  to  them  genericallv  as  "the  channel," 
even  though  many  transformations  mav  actually  have  taken  place.  i(’)  is 
much  more  than  a  convenient  shorthand  notation.  It  will  be  discussed 
at  length  later. 

Assuming  the  medium  and  arrav  are  linear,  when  backscattered  energy 
is  present  in  the  measurement : 

_r  ( t  ,  w  )  -  f  s_(  t  )  +  n  ( t  ,  m  ) 

When  the  object  is  absent: 


In  practice,  we  do  not  know  which  form  of  r(*,*)  is  correct.  The 
process  of  deciding  between  these  alternatives  in  a  statistically 
meaningful  sense  is  called  signal  detection.  Implementing  the  structure 
which  solves  the  detection  problem  is  the  basic  issue  addressed  in  this 
dissertation. 

1 . 3  Basic  Approach  to  *~he  Problem 

Let  us  begin  by  describing  the  measurement  model  in  greater  detail, 
since  it  represents  a  central  part  of  the  overall  approach  to  the  pro¬ 
blem.  We  stated  that  £(•)  represents  the  effects  of  the  scattering 
channel;  however,  we  have  yet  to  define  £(•)  in  mathematical  terms,  or 
to  motivate  its  usefulness  in  this  context. 

£(•)  is  a  bounded,  linear,  stochastic  operator  which  will  represent 
the  overall  effects  of  the  medium.  This  approach  is  nor  new.  It  was 
suggested  bv  Middleton  thirtv  vears  ago  [3]  and  reintroduced  more 
recently  bv  Sohie  [4].  However,  we  are  the  first  to  applv  stochastic 
operator  theorv  systematical lv  to  the  channel  modeling  and 
Identification  problem. 

A  stochastic  operator  theoretic  approach  is  useful  for  several  rea¬ 
sons.  First  of  all,  it  is  clear  that  deterministic  representations  of 
real  scattering  media  are  totally  inadequate.  Stochastic  transforma¬ 
tions  must  be  used  in  order  to  develop  realistic  models  for  the  channel 
effects,  and  stochastic  operator  theorv  provides  the  means  to  do  so.  In 
this  dissertation,  the  stochastic  operator  modeling  problem  will  be 
studied  in  some  detail.  Also,  although  an  operator  theoretic  approach 
to  the  problem  is  actually  quite  abstract,  it  allows  the  detection 
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problem  to  be  solved  generically  for  a  wide  class  of  stochastic  media, 
bypassing  a  detailed  mathematical  analysis  of  many  special  cases. 


Moreover,  this  allows  theorems  and  ideas  from  functional  analysis  and 
Hilbert  space  theory  to  be  used  to  obtain  new  results  and  new  insights 
into  the  modeling  issues. 

Now  that  the  nature  of  £(•)  has  been  described  in  detail,  let  us 
turn  our  attention  to  solving  the  detection  problem.  We  call  upon  the 
array  processing  system  to  make  a  judgement  concerning  the  nature  of 
r(*,*).  This  problem  is  distinguished  by  the  facts  that  the  processor 
has  only  limited  knowledge  of  the  backseat tered  signal,  and  that  random 
noise  always  obscures  the  signal  to  a  varying  degree.  Hence,  it  is 
logical  to  conclude  that  the  required  judgement  must  be  a  statistical 
inference  based  on  results  from  statistical  decision  theory. 

The  signal  detection  problem  is  equivalent  to  a  statistical  hypo¬ 
thesis  testing  problem,  in  which  the  hypothesis  that  noise  alone  is 
present  is  to  be  tested  against  tne  hypothesis  that  signal  and  noise  are 
present.  These  alternatives  are  expressed  in  statistical  terms  bv : 


H  i  :  _r ( t ,  id  )  =  ££(  t  )  +  n(  t ,  w ) 
Hn:  r(  t ,  u )  ■  n(  t ,  w  ) 


W 


\>  *>  V  *v 

v  vvV 


/Sv  v . 

y'-w 
-  \  •% 
/  .» 

•vw 

•  •V 

/.  .V  .  / 


.V./.V.VJ 


where  H|  and  Ho  are  abbreviations  for  hypothesis  one  and  the  null  hypo¬ 
thesis  respectively. 

The  solution  to  the  detection  problem  is  well  known  and  can  be 
found  in  anv  one  of  several  references  (1)  |1J.  The  optimal  test,  or 

optimal  processor,  is  prescribed  bv  calculating  a  real-valued  statistic 


.  A W *• 

•  V  v  V..- 


ja  *  i%jSJ  * 


V/.V; 


w  v  ■ 
'■.Vvv 

v  «V  \  .%'s' '.'.1 
■  V.  •  ./,  .  „  A  v,  , 


of  r(*,*)  called  the  likelihood  ratio,  and  comparing  this  result  to  a 


predetermined  number  y.  If  the  likelihood  ratio  exceeds  this  threshold, 
we  assert  that  Hj  is  true;  otherwise,  we  assert  that  the  null  hypothesis 
is  correct. 

The  processor  can  be  implemented  using  several  equivalent  struc¬ 
tures.  We  have  already  described  the  "estimator-correlator"  realiza¬ 
tion,  in  which  the  non-causal  conditional  mean  estimate  of  £s(*)  is 
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correlated  with  a  filtered  version  of  the  data  _r ( •  ,  •  )  tc  ohtain  the 
likelihood  ratio.  This  is  the  basic  structure  that  will  be  studied  in 
this  dissertation. 

There  are  several  reasons  why  we  have  chosen  the  estimator- 
correlator  structure.  First,  it  establishes  a  basic  connection  between 
detection  theory  and  estimation  theory.  This  can  be  seen  by  comparing 
this  structure  with  the  structure  solving  the  known  signal  in  Gaussian 
noise  detection  problem  (Figu  e  1-3).  Clearly,  the  two  are  similar. 

The  estimator-correlator  treats  the  conditional  mean  estimate  as  if  it 
were  deterministic  in  the  subsequent  correlation  operation. 

While  the  preceding  discussion  points  out  a  theoretically  elegant 
connection  between  the  es t ima tor-cor re  la  tor  and  other  processors,  there 
ire  more  fundamental  reasons  why  an  estimator-correlator  realization  is 
useful.  Thev  can  be  seen  by  examining  the  nature  of  the  solution  in 
greater  detail. 
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Solving  Equations  (1.1-1)  and  (1.1-2)  for  £(•,*)  and  _0( • , • )  pre¬ 


supposes  perfect  a  priori  knowledge  of  covariance  kernels  Jl}j(*,*), 
_Ry(*,*),  and  jtj(*,*).  However,  in  problems  of  practical  interest,  this 
information  is  seldom  available  in  advance.  This  clearly  suggests 
attempting  to  implement  the  processor  using  adaptive  signal  processing 
techniques;  indeed,  adaptation  is  the  key  to  optimum  receiver 
implementation  [5] .  The  estimator-correlator  structure  is  particularly 
useful  from  this  point  of  view. 

We  will  show  that  R>j(*,*)  and  _Ri(*,*)  can  be  estimated  directly 
from  array  data.  On  the  other  hand,  _Ry(*,*)  can  not  be  obtained 
directlv,  since  the  return  £s^ • )  is  always  obscured  by  background  and 
measurement  noise.  If  a  priori  knowledge  of_Ry(*,*)  is  unknown  cr 
incomplete,  another  means  must  be  found  to  estimate  its  salient  features 


in  conjunction  with  the  detection  process. 

This  aspect  of  the  problem  will  be  approached  from  a  system 
modeling  and  identification  point  of  view.  The  stochastic  transmission 
media,  represented  generically  by  linear  stochastic  operator  £(•),  can 
be  interpreted  as  an  unknown  linear  system.  By  exploiting  results  from 
functional  analysis  and  stochastic  operator  theory,  relationships 
between  _Ry  ( * , * )  and  £(•)  suitable  for  digital  signal  processing 
applications  can  be  derived.  In  particular,  the  formulations  are  based 
on  the  matrix  representation  of  £(•)  and  the  spectral  representation  of 
jl y (*,*).  Integrating  these  results  into  the  estimator-correlator 
structure  is  straightforward  and  establishes  a  fundamental  connection 
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among  detection  theory,  estimation  theory,  system  modeling,  and  system 
identification  theory. 

In  this  dissertation,  we  will  also  study  the  implementation  problem 
in  some  detail.  The  theoretical  analysis  and  practical  implementation 
are  both  based  on  Karhunen-Loeve  expansions  of  the  received  data.  Of 
course,  the  use  of  series  expansions  as  a  theoretical  tool  in  such 
disciplines  as  detection  theory  and  estimation  theory  is  well  estab¬ 
lished,  and  they  provide  a  great  deal  of  insight  into  the  nature  of  the 
optimal  processor.  However,  we  assert  that  they  are  the  key  to 
implementation  as  well.  Going  bevond  formal  manipulation  of  infinite 
series,  we  show  how  these  representations  lead  to  structures  which 
can  be  implemented  with  state-of-the-art  computational  algorithms. 

The  results  establish  relationships  among  Karhunen-Loeve  expansions, 
matrix  decompositions  such  as  singular  value  decompositions  and  OR 
factorizations,  and  generalized  eigenvalue  problems. 

In  conclusion,  our  approach  to  the  problem  is  based  on  the 
estimator-correlator  canonical  structure.  Within  this  framework,  the 
interrelations  among  system  modeling,  stochastic  operator  theory, 
identification,  and  orthonormal  expansions  will  be  established.  The 
solutions  to  the  implementation  problem  are  based  on  Karhunen-Loeve 
series  expansions.  They  lead  to  structures  that  can  be  realized  through 
robust  numerical  algorithms. 


1.4  Review  of  Previous  Work 


The  estimator-correlator  structure  was  first  studied  by  Price  [6] 
in  the  context  of  the  Gaussian  signal  in  Gaussian  noise  detec¬ 
tion  problem.  He  noticed  that  the  solution  could  be  interpreted  as  a 
modification  of  the  standard  "correlation”  receiver  which  solves  the 
known  signal  in  Gaussian  noise  detection  problem.  The  modification  is 
elegant  and  intuitive.  Specifically,  the  optimal  estimate  of  the  signal 
process  is  used  as  if  it  were  deterministic  in  the  subsequent 
correlation  operation. 

Later,  Kailath  [7]  and  Esposito  [8]  studied  this  structure  in 
more  detail.  Both  argued  that  it  should  be  optimum  or  close  to  optimum 
for  detecting  random  signals  (not  necessarily  Gaussian)  in  additive 
Gaussian  noise.  However,  their  results  must  be  interpreted  carefully. 
For  example,  although  Esposito  was  able  to  show  that  an  "estimator- 
correlator"  structure  exists  for  a  broad  class  of  random  signals,  it  is 
not  possible  to  interpret  the  signal  estimate  as  "optimal"  in  a 
meaningful  sense  except  for  Gaussian  signals.  Moreover,  since  the 
signal  process  estimates  can  not  be  determined  uniquely,  they  actually 
have  little  intrinsic  value  except  in  the  context  of  the  receiver 
structure  itself. 

A  few  years  later,  Kailath  returned  to  the  problem  of  detecting 
non-Gaussian  stochastic  signals  v(*,*)  in  additive  Gaussian  noise  [9], 
and  proved  that  the  log-likelihood  functional  has  the  form: 

£(r)  =  /  r  ( t ,  w)  y(t,  w)  dt  -  -j  f  |y(t,  io)|2  dt  (1.4-1) 
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where  y(*,*)  is  the  causal  conditional  mean  estimate  of  y(*,*)  given 
r(*,*),  assuming  that  Hj  is  true.  Equation  (1.4-1).  shows  that  an 
estimator-correlator  interpretation  of  detection  can  be  generalized  to 
encompass  a  broader  class  of  signals.  However,  the  “correlation" 
integral  has  to  be  defined  as  an  Ito  stochastic  integral,  making 
practical  implementation  of  (1.4-1)  problematical.  Furthermore,  Kailath 
pointed  out  that  this  result  was  not  merely  a  question  of  rigor. 

Other  definitions  for  the  correlation  integral  yield  detector  structures 
inconsistent  with  previously  obtained  results. 

More  recently,  Schwartz  extended  these  ideas  to  discrete  time 
problems  [10],  and  in  addition,  demonstrated  that  the  structure  is 
optimal  when  the  data  come  from  generalized  exponential  distributions,  a 
broad  and  important  class  of  distributions.  During  this  research, 
the  author  generalized  his  results  to  the  vector  measurement  case. 

Once  again,  a  conditional  mean  estimate  of  the  signal  is  the  central 
part  of  the  structure,  and  it  appears  in  a  correlation  integral  used  to 
calculate  the  likelihood  ratio.  However,  evaluating  the  likelihood 
ratio  is  quite  difficult,  and  as  Schwartz  himself  noted,  implementing 
this  generalized  estimator-correlator  structure  would  not  be  a  simple 
task . 

Actually,  the  importance  of  these  results  relates  to  the  insights 
they  give  into  optimal  receiver  structures.  They  show  that  the 
estimator-correlator  interpretation  of  detection  can  be  generalized  to 
many  different  of  signal  and  noise  models.  In  other  words,  much  of  the 


intuition  provided  by  an  estimator-correlator  interpretation  of  the 
solution  to  the  Gaussian  signal  in  Gaussian  noise  detection  problem 
carries  over  to  more  general  situations. 

The  optimum  and  adaptive  array  processing  problems  have  received  a 
great  deal  of  attention  for  nearly  three  decades,  and  consequently, 
there  is  an  abundance  of  literature  in  these  areas.  It  is  beyond  the 
scope  of  this  dissertation  to  present  a  detailed  review  of  the  work 
already  accomplished  in  these  fields;  however,  we  shall  briefly  point 
out  several  references  of  special  interest. 

Van  Trees  studied  optimal  array  processing  techniques  in  a  classic 
report  published  two  decades  ago  [2].  In  particular,  he  examined  the 
Gaussian  signal  in  Gaussian  noise  detection  problem,  and  derived  several 
equivalent  forms  of  the  optimal  structure,  including  the  estimator- 
correlator.  Other  landmark  papers  were  written  by  McDonough  [11],  Bryn 
[12],  Edelblute  and  his  colleagues  [13],  and  Cox  [14],  Adaptive  array 
processing  has  been  discussed  in  Monzingo  and  Miller  [15],  Haykin  [16], 
Hudson  [17],  and  Widrow  and  Stearns  [18].  An  extensive  bibliography  of 
current  work  in  this  area  is  contained  in  a  new  book  edited  bv  Sibul 
[19]. 

1 . 5  Overview  of  the  Dissertation 

Chapter  2  introduces  the  concepts,  definitions,  notation,  and 
theorems  that  will  be  used  in  subsequent  chapters.  In  particular,  the 
Karhunen-Loeve  expansion  is  defined,  and  a  method  of  calculating  its 
basis  from  an  arbitrary  orthonormal  basis  is  introduced. 

In  Chapter  3,  the  channel  modeling  problem  is  studied.  Our 
approach  is  based  on  matrix  representations  of  bounded,  linear  opera- 


tors.  The  results  have  interesting  implications  in  several  disciplines, 
including  the  theory  of  non-stationary  stochastic  processes. 

Solutions  for  _0( • , • )  and  are  derived  in  Chapter  4.  Spectral 

representations  for  _Rjj(  •  ,  • ) ,  Ry(*,*),  and  Rj(*,*)  are  used  to  solve  the 
processor  equations.  Indeed,  we  will  argue  that  this  approach  is  not 
only  a  useful  theoretical  tool,  but  is  also  the  key  to  adaptive 
implementation.  The  relationship  between  £(•)  and  _Ry(*,*)  is  estab¬ 
lished  and  incorporated  into  the  estimator-correlator  structure. 

It  turns  out  that  the  relationship  between  £(•)  and  Ry(',*)  which 
is  developed  in  Chapter  4  is  not  particularly  convenient  from  an 
implementation  point  of  view.  Therefore,  in  Chapter  5,  the  stochastic 
identification  problem  is  examined  in  further  detail.  Simultaneous 
diagonalization  of  the  input  and  output  covariance  kernels  is  used  to 
obtain  a  simplified  relationship. 

The  numerical  issues  associated  with  adaptive  implementation  are 
studied  in  Chapter  6.  Our  results  establish  several  interesting 
connections  among  canonical  matrix  decompositions  and  the  Karhunen- 
Loeve  expansions  of  _£(•,•)  and  Q( • , • ) .  In  addition,  the  CS  decompo¬ 
sition  is  Introduced  as  a  numerically  stable  method  for  obtaining  G( •  ,  • ) 
from  array  measurements. 

In  Chapter  7,  numerical  results  are  presented  and  evaluated.  We 
consider  an  example  of  considerable  practical  interest;  namely,  detec¬ 
tion  in  multipath  propagation  channels.  Finally,  conclusions  and 
recommendations  for  further  research  are  given  in  Chapter  8. 
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Chapter  2 


MATHEMATICAL  BACKGROUND 

2 . 1  Introduction  and  Overview 

This  chapter  introduces  the  concepts,  definitions,  notation,  and 
theorems  that  will  be  used  in  subsequent  chapters.  Results  will  be 
stated,  hut  proofs  and  derivations  will  be  omitted.  The  interested 
reader  can  find  careful  developments  of  Hilbert  space  theory  and  linear 
operator  theorv  in  one  of  several  standard  texts  [20]  [21]  [22]. 

We  begin  by  establishing  the  mathematical  structure  in  which  sig¬ 
nals  are  represented  as  elements  in  separable  Hilbert  spaces.  This 
framework  allows  us  to  represent  signal  processing  operations  as 
bounded,  linear  operators  defined  over  a  Hilbert  space  of  interest. 

Next,  the  problem  of  representing  deterministic  and  stochastic  signals 
is  examined.  We  introduce  generalized  Fourier  series  expansions  for 
both  deterministic  and  stochastic  signals,  and  the  Karhur.en-Loeve  expan- 
sion  is  defined.  A  new  method  of  calculating  the  Karhunen-Loeve  basis 
is  introduced.  Finallv,  we  discuss  deterministic  and  stochastic  opera¬ 
tors,  and  demonstrate  their  usefulness  in  the  context  of  this  work. 

2 . 2  Hilbert  Spaces 
2.2.1  Definition 

A  Hilbert  space  is  defined  as  a  complete  inner  product  space  [20]. 
An  inner  product  space  is  a  linear  vector  space  endowed  with  a 
functional  which  maps  the  product  space  H  x  H  onto  the  set  of  complex 
scalars.  The  functional  is  called  the  inner  product,  and  will  be 
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denoted  by  <  *,*  >.  This  inner  product  induces  a  norm  onto  H  which  is 
Riven  bv  the  definition 

I | x | |  =  <  x,x  >1/2  (2.2-1  ) 


for  all  x  in  H. 

Inner  product  spaces  are  complete  if  every  Cauchy  sequence  con¬ 
venes  to  a  point  in  the  space. 

Only  separable  Hilbert  spaces  shall  be  studied  in  this  disserta¬ 
tion.  A  Hilbert  space  is  separable  only  if  an  orthonormal  basis  exists 
in  the  space. 

2.2.2  Examples 

The  set  of  all  N-tuples  or  complex  scalars  is  a  Hilbert  space  with 
inner  product 


<  x,v  > 


N 

L 

1  =  1 


(2.2-2) 


for  all  x  and  y  in  H.  This  elementarv  Hilbert  space  is  very  useful  in 
signal  processing  applications  for  representing  finite-dimensional 
deterministic  signals. 

The  space  of  all  square-integrable  functions  defined  on  an  inter¬ 
val  (a,  b)  of  the  real  line  can  be  shown  to  be  a  separable  Hilbert  space 
[20],  In  signal  processing,  this  could  represent  all  possible  finite- 
energy  waveforms  received  over  a  finite  time  interval  T  or  spatial 
aperture  A.  In  these  cases,  a  suitable  inner  product  is 


The  norm  fjxU  represents  the  energv  of  a  signal  x. 

If  a  received  signal  is  more  appropriately  represented  as  a  sto¬ 
chastic  process,  a  useful  inner  product  definition  is: 

<x,v  >  =  t  E  {  x  ( t  ,  ui )  v  *  (  t ,  uj ) }  dt  (2,2-4) 

T 

Various  combinations  of  purelv  temporal,  purely  spatial,  deterministic, 
or  stochastic  signals  can  he  chosen,  and  clearlv,  a  large  number  of 
special  cases  can  be  studied  in  a  common  framework. 

2 . 3  Deterministic  Signal  Representations 
2.3.1  Fourier  Series 

Next,  consider  the  problem  of  obtaining  numerical  representations 
for  signals  belonging  to  a  separable  Hilbert  space  H.  Since  we  are  onlv 
considering  these  Hilbert  spaces,  it  is  possible  to  find  an  orthonornal 
basis  such  that  everv  element  of  H  has  the  representation 

x  =  i  x  $  (2.3-1  ) 

k  k  K 

Bv  definition,  the  orthonormal  svstem  is  called  an  orthonormal 

basis  for  H,  and  each  complex  scalar  xj<  is  called  a  Fourier  coefficient 
of  x.  The  Fourier  coefficients  are  directlv  related  to  x  through  a 


simple  inner  product  operation: 
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The  Sequence  ol  Fourier  I-  Iftli.ients  t  X'K  ;  1  14  culled  the  re  presen  t  .1 1  1  '  It 

<•>  t  x  with  res  nett  to  ($),. }. 


J.'J.J  !  ’Uerpret  at  i  on  .it  Oncr.i  1  i  zed  Fourier  Series  Fxpausi  ms 


'Tt  hnnorna  i  expansions  ut  deterministic  and  storhast  i  c  signals 
provide  the  fundanent.il  connection  hetw<*eii  analoii  signals  and  their 
1  i  it  i  t  a  1  representations.  For  example,  if  s(  •  )  is  a  ileterni  ni  st  i  s  i  ,>nu  1 
handlimited  to  the  interval  (-3,  j)  on  the  angular  trenuem-v  ixis,  then 
s { • )  an  he  expanded  in  terns  of  a  generalized  Fourier  seri »  s 


s  (  t  ) 


s(nT) 


sin  o ( t  -  nT ) 


j  t  t  -  nT ) 


'  J.  > 


The  Fourier  .  oe r f i c i ent s  are  samples  ol  s(»),  Frequent Iv,  it  is  often 
more  convenient  to  work  with  die  samples  <  s(nT)}  than  the  actual  signal 
s(*l.  Moreover,  it  can  he  shown  that  I. a  is  unitarilv  equivalent  ( 2" i 
with  tfie  space  d  of  squa re -sunnahl e  infinite  sequences.  This 
isomorphism,  combined  with  the  direct  connection  he  tween  a  si.tn.il  nd 
its  equivalent  1?  representation,  is  the  theoretical  justification  tor 
sampled  data  or  d i sc  re t e- 1 i me  signal  processing.  !’roblens  formulated  in 
terms  of  continuous  tine  or  continuous  aperture  measurements  nav  he 
easier  to  solve  in  their  equivalent  £a  representations. 

The  choice  of  orthonormal  basis  depends  on  the  class  <>t  signals 
the  designer  expects  id  encounter.  Examples  of  bases  frequent lv  used  in 
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that  fir  anv  t  c  T ,  x(  t 


)  is  a  random  variable 


A  stochastic  vector 


process  is  an  S'-d  i  mens  i  ona  1  vector  whose  components  are  stochastic 
processes;  that  is 

x  (  • , • )  *  (x|(*,-l  ...  xv(-,*)]T  (2.4-1) 

where  Xj(*,-'  is  a  stochastic  process  tor  i  =  1,2,...,  N.  A  complex¬ 
valued  stochastic  process  z(*,*)  is  a  special  case  of  a  two-dimensional 
stochastic  vector  process: 

/( t ,  .)  3  x( t ,  +  iv(t  ,  .  i  (2.4-2) 

tor  ill  r  ■  t  *nd  _  r  ... 

"wo  particular  forms  of  the  index  set  T  are  important.  If  T  is  a 
sequence  •  t  \  ,  t  ' ,  t  ,  then  x(  ■  ,  •  )  is  called  a  discrete-time 

stochastic  process.  in  the  other  land,  if  T  is  an  interval  of  the  real 
line,  >:!*,•>  is  called  a  con  t  i  nuous  - 1  i  me  stochastic  process.  In  this 
dissertation,  we  will  have  the  occasion  to  use  both  tvpes  of  stochastic 
•’  r  oce  s  se  s  . 

...  •  a  non  l  a  1  e  present  at  1  vis  and  Karhunen-l.oe  ve  expansions 

The  rigorous  definition  ..f  a  stochastic  process  is  actual lv  quite 
i  hs  t  r i c  t .  It  is  an  uncountable  collection  of  measurable  functions 
defined  on  i  diveo  probability  space.  A  represent  at i on  more  suitable 
for  •unerical  calculations  is  needed. 

Anv  stochastic  process  x(  • ,  •  )  defined  on  the  product  space  T  x  .. 
wit1  cntinuous,  snua  r -  i  n  t  eif  r  a  b  1  e  covariance  kernel  Rx(.  .)  can  be 
ex; anded  i  «r  •  \  dene ra 1 i zed  Fourier  series 


x(  t  ,  iu) 


x  (w)  $.(t) 


(2.4-3) 


where  j^(-)  is  an  orthonormal  basis,  and 


Xi(ui)  »  <  x,  > 


for  all  i  [22).  The  functional  <  *,♦  >  is  the  standard  inner  product 


defined  over  the  space  of  interest.  The  series  coefficients  (xj(*)}  are 


random  variables  whose  numerical  values  depend  on  the  particular  reali¬ 


zation  w  e  fi . 


Fquation  (2.4-3)  can  be  thought  of  as  a  decomposition,  in  which 


function  of  two  variables  is  expressed  as  a  sum  of  products  of  functions 


of  one  variable.  These  decompos i t i ons  are  familiar  from  the  theorv  of 


partial  differential  equations,  and  their  significance  in  this  context 


is  essentially  the  same.  Since  the  basis  functions  { ^  (  * ) }  are 


deterministic,  we  can  replace  the  study  of  an  uncountable  set  of 


stochastic  processes  {x(*  ,*)}  by  a  countable  set  of  random  variables 


rxj(>)).  Moreover,  the  ideas  from  Hilbert  space  theorv  can  be  applied 


to  this  representation  provided  the  inner  product  functional  is  chosen 


appropriately. 


Fquation  (2.4-3)  is  a  generalized  Fourier  series,  hence  anv 


plete  orthonormal  hasis  {$i(*)i  spanning  Lo(a,  b)  can  be  used.  However, 


or.e  series  is  important  enough  to  warrant  a  name.  This  representation 


Is  railed  the  Karhunen-I.oeve  orthonormal  expansion  [25]  [2ft]  of  x(-,-), 


md  its  usefulness  can  be  explained  as  follows.  If  the  unique  Karhunen- 
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Loeve  basis  is  used,  a  nice  "double  orthogonality"  results.  The  basis 
functions  are  orthonornal : 


/  4>.  (t)  $*(  t)  dt  =  6.  . 
t  i  J  i.l 


(2.4-4) 


and  so  are  the  expansion  coefficients: 

F.{ x .  (u>)  x*(w)}  =  X(x)  6.  .  (2.4-5) 

l  j  l  ij 

The  basis  providing  for  properties  (2.4-4)  and  (2.4-5)  can  be  found  bv 
solving  the  integral  equation 

X.  <t>.(t)  =  f  R  (t,  u)  4>  ■  ( u )  du  (2.4-b) 

11  X  1 

where  Rx(t,  u)  =  E{x(t,  u )  x*(u,  ui ) )  is  the  covariance  function  of 
process  x( •  ,  • ) . 

\nother  important  result  is  Mercer's  Theorem  [22],  which  states 
that  anv  positive  definite,  Hermitian,  square-integrable  kernel  k(-,*) 
car.  be  expanded  in  a  series  representation: 

k(t,s)  =  l  \(k)  *  (t)  <f>*(s)  (2.4-7) 

1  '  J  J 

.  (k), 

where  {X^  }  and  { 4>  j  (  • ) }  are  the  eigenvalues  and  eigenfunctions  of 
k(*,*).  This  is  the  spectral  representation  for  k( • , ■ ) ,  one  which  is 
particularly  convenient  for  numerical  calculations.  Since  covariance 


.  I  x.(w)  4>.(t)  [  l  x.(u>)  *  .(u)  *} 
1=1  1  =  1  J  J 


E  {  l  l  x?(u)  x.(w)  $*(t)  <J> .  ( t )} 
i=l  j=l  1  J  1  J 


(2.5-3) 


Since  the  Fourier  series  converges  to  x( • , * )  in  the  mean-square  sense, 
the  expectation  operator  and  the  double  suns  can  be  interchanged  [25]: 


R  (t ,  u )  =  )  1  E{  x .  (to )  x* 

X  1  =  1  j  =  l  1  7 


*(w)}$.(t)  $*(u) 
.1  1  J 


l  I  r .  .  $ . ( t )  $*(u  ) 

iii  j=i  17  1  7 


(2.5-4) 


The  double  sum  is  conviently  written  in  vector -matrix  notation: 


Rx(t,u)  =  j>H(t)  Rx  Hu) 


(2.5-5) 


where 


4>(t)  =  [  4>  ]  ( t  )  i>2(t)  ...  ]T 


and  _Rx  is  the  infinite  matrix  of  correlations.  It  is  easy  to  show  that 
Rx  is  Hermitian,  which  implies  that  _Rx  has  a  unique  eigenvalue  decompo¬ 
sition  [22 1  : 


Rx  =  u  y  jjh 


(2.5-6) 
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Then  Rx(*,*)  can  be  written 


Rx(t,u)  =  ^  ^J(u) 

=  $_H(t)  U  _A  ’  UH  _*(u) 

00 

=  I  A^X)  *k(t)  <(u)  (2.5-7) 

k=l 

The  {?>!<(  * ) )  functions  are  a  linear  combination  of  the  {$]<(•)}  basis: 

00 

V°  =  l  \i  Vc)  (2*5"8) 


They  are  orthonormal: 


* 
u . 
in 


♦„<'»(  i, 

n  =  1 


u*  <!>  (t)]*dt 
jn  n  J 


00 


=  /  I 

T  m=  1 


u*  u.  <?>  (t)  $*(t)  dt 
lm  jn  m  n 


\ 

L 

n=  1 


oo 

l  u?  u.  /  $  ( t )  $*( t  )  dt 
im  in  '  m  n 


=  y  u*  u .  =  5  .  . 

lm 


since  the  rows  of  _U  are  orthogonal.  Furthermore,  the  { <J>  k  (  * ) }  functions 
are  complete,  since  each  is  a  linear  combination  of  a  complete  ortho¬ 
normal  basis.  Therefore,  the  {$(<(•)}  set  is  a  complete  orthonormal 


I/»I. 


■t.  it.M.  iL  ^i.  »u  >» 


basis,  and  since  the  basis  which  diagonalizes  the  matrix  representation 

*» 

of  Rx( • , • )  is  unique,  Che  functions  are  the  Karhunen-Loeve  basis,  the 
representation  (2,5-7)  is  the  spectral  representation  of  Rx(*>*)>  and 
the  parameters  {A^  }  are  the  eigenvalues  of  integral  equation  (2.4-6). 

v 

In  principle,  this  allows  the  Karhunen-Loeve  basis  and  expansion 
coefficients  to  be  obtained  from  an  arbitrary  expansion,  provided  the 
second  order  statistics  of  its  Fourier  coefficients  are  known.  The 
Karhunen-Loeve  basis  is  calculated  with  Equation  (2.5-8),  and  the 
uncorrelated  expansion  coefficients  are  given  by  the  transformation 


x  (uj )  =  JJH  5c(w) 
KL 


(2.5-9) 


where  x  (•)  and  x( • )  are  column  vectors  containing  the  expansion 
KL 

coefficients.  This  transformation  decorrelates  the  coefficients,  since 


E{xkl(oj)  x^uj)}  =  JJH  E{x(w)  x%)}U  =  diag  [xjx)  X(2X>  ...  ] 


In  Chapter  6,  we  will  show  how  to  calculate  the  expansion  directly  from 
data  with  numerically  stable  algorithms. 


2.5.2  Example 

This  example  is  based  upon  a  measurement  model  used  frequently  by 
Nolte  and  his  colleagues  [27]  [28],  Consider  a  time-limited  stochas¬ 
tic  process  measured  over  the  interval  (-T0/2,  T0/2).  Then  x( • , • )  can 
be  expanded  into  the  Fourier  series 
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c(t,aj)  =  £  x  (w)  e^o01" 


(2.5-10) 


where 


1 


Xn(u)  (2ttT  )  1/2 

o  T 


/  x(t,w)e  jaJont  dt 


In  order  to  simplify  the  representation  of  x(  * , • ) ,  they  made  the  addi¬ 
tional  assumption  that  x( • , • )  is  essentially  bandlinited  to  ±  Lw0 
radians/sec.  Then 

L 

I 

n=-L 


x(t  ,u) 


y  x  (w)e^tJont 
L  n 


(2.5-11) 


Of  course,  the  expansion  coefficients  (xn(*)}  are  correlated.  Can  the 
Karhunen-Loeve  eigenfunctions  and  expansion  coefficients  be  obtained 
from  Equation  (2.5-11)? 

The  covariance  operator  Rx(*,*)  is 


L  L 

Rx(t,u)  =  E{x(t,u>)  x*(u,w)}  =  E{  l  xk(ui)  Xk(t)  [  l  x*(u>)  xt(  t )  ] } 

k=-L  Z=-L 


l  l  E{x  (oi)  x*(u>)}  xk(t)  Xj(u) 
k=-L  Jt— L  1  1 


L  L  ... 

V  \  cf  /  \  A/  \  ^  j  cu  k  t  j  cu  £  u 

l  l  E{  x  (a) )  x  ( u> ) }  e  o  e  J  o 

k=-L  £=-L  K  1 


(2.5-12) 
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In  vector-matrix  notation,  the  Hermitian  form  (2.3-12)  is  given  by 


Rx(t,w)  -  j£^(t)  X 


(2.5-13) 


where 


T ,  .  ,  -joj  Lt  iw  Lt , 

X  (t)  =  (e  o  ...  eJ  o  ] 


is  a  (2L  +  1)  x  1  column  vector,  and 


RxX  =  E{x(di)  xH(w)} 


(2.5-14) 


is  the  covariance  matrix  of  the  ( 2L  +  1)  Fourier  coefficients.  Perform¬ 
ing  an  eigenvalue  decomposition  of  _RxX  gives 


Rxx  =  U  Ax  UH 


(2.5-15) 


and  substituting  (2.5-15)  into  (2.5-13)  leads  to  the  result 


Rx(t,u)  =  XT(t)  II  Ax  UH  X*(,l)  =|T(t)  Ax  **(u)  (2.5-16) 


It  is  easy  to  see  that: 


_£(u)  =  JjT  j^(u) 


(2.5-17) 


Rewriting  (2.5-16)  in  terms  of  a  sum  yields: 


2n+l 

R  (t ,u  )  =  l 
x  u 

n  =  l 


A(X)  *  (t)  #  *(  u  ) 
n  n 


n 


(2.5-18) 
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where  <f  (•)  is  the  nth  element  of  the  column  vector  $>(*),  and  moreover, 


is  a  linear  combination  of  the  complex  exponential  terms.  Equation 
(2.5-18)  is  the  result;  namely,  it  is  the  spectral  representation  of 
Rx( • , • ) .  Since  the  spectral  representation  is  unique,  the  functions 
■f  ( * ) }  are  the  Karhunen-Loeve  basis,  and  the  Karhunen-Loeve  expansion 
of  x( • , • )  is 


2N+1 

x(  t  ,uj )  =  l 


xKLU)  *n(t> 


The  {x  (•)}  coefficients  can  be  obtained  directly  from  (2.5-10)  by  a 
KL 


linear  transformation: 


_xKL(w)  =  U  >c(u)) 


where 


2Ckl(w)  =  [x ^  ( co ) 


x2n+1(u))]T 


_x(u>)  •=  [x  (u) 


x_L(w) ]T 


The  covariance  matrix  can  be  estimated  from  measured  data  by 

averaging  over  several  sets  of  measurements.  This  procedure  will  be 
discussed  in  Chapter  6. 
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2 . 6  Deterministic  and  Stochastic  Operators 


2.6.1  Deterministic  Operators 

Given  Hilbert  spaces  H[  and  H2*  A  function  L( • )  which  maps  H[  into 
H2  is  called  a  linear  operator  if,  for  all  x  and  y  in  Hj,  and  complex 
scalars  a: 

L(x  +  v)  =  L(x )  +  L(y) 

L(ax)  =  ctL(x) 

For  convenience,  we  will  write  Lx  instead  of  L(x).  We  will  often  write 
Lx(t)  or  Lx(t,co)  for  x  belonging  to  L2  (deterministic  signals)  or 
L2  x  Q  (stochastic  signals)  respectively.  Since  systems  of  practical 
interest  are  stable  in  the  hounded-i nput ,  bounded-output  sense  [29),  we 
need  onlv  be  concerned  with  hounded  linear  operators.  Mathematical 
representations  for  bounded  linear  operators  shall  be  discussed  in  the 
next  chapter. 

2.6.2  Stochastic  Operators  and  Their  Representations 

Clearly,  deterministic  descriptions  of  many  signal  transformations 
encountered  in  practical  problems  are  inadequate.  For  example,  wave 
propagation  through  realistic  scattering  media  is  described  by  partial 
differential  equations  whose  coefficients  are  stochastic  processes. 
Obviouslv,  deterministic  operator  representations  of  these  transforma¬ 
tions  are  insufficient,  and  more  general  representations  based  on 
probabilistic  notations  are  required. 

A  stochastic  operator  £(•)  is  a  mapping  defined  over  a  Hilbert 
space  H  which  is  indexed  with  respect  to  a  variable  oo  belonging  to  a 
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probability  space  (f2,  F,  P)  [4],  An  example  of  a  linear  stochastic 
operator  defined  over  is  an  N  x  N  matrix  whose  elements  are  random 


variables.  to  could  represent  the  value  of  an  unknown  physical  quantity, 
such  as  the  velocity  of  wave  propagation  in  a  medium. 

A  useful  representation  for  stochastic  operators  defined  over  1*2  is 
the  random  Green's  function  representation.  This  description  can  be 
justified  on  physical  grounds  since  one  can  demonstrate  that  solutions 
to  the  stochastic  wave  equation  are  written  in  this  form  [30]  [31]. 
Furthermore,  models  for  range  spread,  Doppler  spread,  and  double  spread 
channels  can  also  be  expressed  as  random  Green's  functions  [1], 

A  random  Green's  function  representation  for  £(*)  is  given  by: 


y(t,  to)  =  £x(  t ,  to)  =  /  h(t,  t,  to)  x(t,  to)  di 


(2.6-1) 


The  random  Green's  function  h( •,*,•)  is  the  impulse  response  function 
for  a  linear  stochastic  system,  and  is  a  generalization  of  the  impulse 
response  function  for  a  linear,  time-variant  deterministic  system. 

This  description  of  the  medium  establishes  a  connection  between  the 
physical  description  of  realistic  scattering  media  and  the  system 
theoretic  representations  familiar  to  engineers  and  applied 
mathematicians.  A  more  detailed  discussion  of  this  approach  to  linear 


stochastic  system  modeling  is  contained  in  a  recent  monograph  by  Adomian 


2.7  Conclusions 


We  have  established  the  mathematical  structure  which  will  he  used 
throughout  the  dissertation.  Deterministic  and  stochastic  processes 
will  be  represented  by  generalized  Fourier  series  expansions.  Signal 
processing  operations  and  stochastic  transmission  media  will  be  modeled 
bv  bounded,  linear  operators  defined  over  Hilbert  spaces  of  determinis¬ 
tic  and  random  signals. 

The  Karhunen-Loeve  expansion  was  defined  and  discussed  in  some 
detail.  Its  properties  will  be  exploited  later  when  the  implementation 
problem  is  studied.  This  expansion  can  be  obtained  from  arbitrary 
expansions  by  linear  transformations  of  the  basis  functions  and 


MATR I X  RF.  PRESENTATIONS  OF  PROPAGATION  AND  SCATTERING  OPERATORS 


3 . 1  Int  roduct ion 

We  shall  begin  hv  examining  the  channel  modeling  problem  in  detail. 
In  this  chapter,  the  goal  is  to  derive  mathematical  representations  tor 
those  deterministic  and  random  transformations  frequent  1 v  encountered  ip. 
practical  arrav  processing  problems.  We  are  especial lv  interested  in 
developing  numerical  representations  that  are  convenient  for  digital 
signal  processing  applications,  and  that  can  he  incorporated  natural lv 
into  the  estimator-correlator  structure.  The  approach  is  based  on 
matrix  representations  of  bounded,  linear  operators. 

This  chapter  contains  several  new  results.  The  matrix  represen¬ 
tations  for  de termini s t i c  operators  defined  over  the  Hilbert  space  of 
handlimited  signals  were  introduced  by  Si  bul  and  the  author  (32). 
Furthermore,  the  results  have  interesting  rami f i cat  ions  in  the  theory  of 
nnn-stationarv  stochastic  processes. 

We  obtain  novel  representations  for  vector-valued  deterministic  and 
stochastic  signals,  and  for  operators  defined  over  these  Hilbert  spaces. 
Matrix  representations  for  beamforming  and  multipath  propagation  chan¬ 
nels  are  derived.  The  model  for  the  multipath  channel  Is  a  generaliza¬ 
tion  of  a  model  which  recentlv  appeared  in  the  literature  [33], 

Section  3.2  introduces  the  fundamental  ideas  which  shall  be  used 
throughout  the  chapter.  In  the  next  section,  we  introduce  matrix  repre¬ 
sentations  for  hounded  linear  operators,  and  show  how  thev  are  useful 


tor  d  i  sc  re  t  e-t  i  me  nrnessim;  applications.  To  Ccnonst  rat e  their  use¬ 
fulness  ,  natrix  representations  for  delav,  tine  stretching/ 


.  T'T.vy 


compression,  and  simultaneous  tine  delav  and  stretching  given  hand 
1  ini  ted  signals  will  he  worked  out. 

’.'ext,  natrix  representations  tor  stochastic  operators  will  be 
developed.  The  representations  are  equivalent  to  those  derived  for 
deternini stic  t  rans  formations ,  and  ran  he  used  to  model  spread 
scat  t  e  r i nr  ned i a . 
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In  Section  1 .  ,  natrix  representations  tor  vector-valued  processes 
will  he  studied  in  some  detail.  It  will  be  seen  that  convenient 
>rthonornal  bases  can  he  constructed  from  simpler  functions.  Matrix 
representations  tor  operators  defined  over  these  spares  are  constructed. 
Actual lv,  in  general,  the  representations  for  these  operators  nust  be 
regarded  as  tensors,  since  thev  are  indexed  with  respect  to  lour 
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indicies.  Fortunatelv,  if  the  basis  is  chosen  judiciously,  the 


operator  representations  can  he  greatlv  simplified. 

Pinal  lv,  natrix  representations  of  multipath  channels  are  derived. 
The  result  is  a  generalization  of  a  model  which  recently  appeared  in  the 
I  i t era tu re  . 


-r.V.V. 

Vv'nV.I 


Deterministic  Operators 


3.2.1  Flenentarv  Transformations 


Let  us  define  several  elementary  signal  processing  t  rans  format  ions , 
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beginning  with  propagation  delav: 


r(t)  =  s(t  -  t),  t  >  0 
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In  the  absence  of  noise,  the  received  signal  is  a  delaved  version  of  the 
transmitted  signal  s ( •  ) . 

Tine  stretching/conpression  is  a  generalization  of  Doppler  shifing 
which  applies  to  wide  band  signals.  It  is  defined  bv 

r ( t )  =  s(at ) 

for  a  >  0.  Simultaneous  stretching  and  delav  is 

r(t)  =  s(a(t  -  i)) 

a  combination  of  the  two  previous  transformations. 

These  ideas  can  be  generalized  in  a  slightly  more  abstract  setting. 
If  s(*l  represents  a  finite  energy  signal,  then  s(*)  and  r( • )  can  be 
represented  as  elements  in  L2  or  an  appropriate  subspace  of  L2 ,  which 
implies  that  the  transformations  defined  above  can  be  represented  as 
linear  operators  napping  elements  from  an  input  space  into  an  output 
space.  The  operators  are  defined  implicitly: 

r(  t )  =  Axs  ( t )  =  s(  t  -  r  ) 

The  stretching/compression  operator  Aa( • )  is: 

Aas(t)  =  s(at) 

Simultaneous  stretching  and  delav  is 


As  s(t)  =  s(a(t  -  t)) 


This  operator  is  a  cascade  of  Aa( • )  and  AT(*): 

As  (  *  )  =  A^  A-j  (  •  ) 

It  is  easv  to  show  that  all  three  operators  are  bounded  and  linear. 

An  explicit  representation  for  a  linear  operator  £(•)  can  be 
defined  by  a  Green's  function  representation: 

00 

£s  ( t  )  =  /  h( t  ,  t  )  x( t )  dt 

—CO 

which  is  a  convolution  integral  for  a  linear,  tine-variant  svsten.  The 
use  of  this  representation  in  signal  processing  applications  is  well 
established  [  1 ]  [ 34 ) . 

1.2.2  Matrix  Representations  of  Bounded  Linear  Operators 

The  operators  defined  in  the  last  section  arp  useful  in  continuous 
time  signal  processing  problems.  However,  for  digital  signal  processing 
purposes,  eauivalent  operators  for  sequences  are  needed.  Furthermore,  a 
representation  which  unifies  the  so-called  implicit  and  explicit  defini¬ 
tions  presented  above  is  highlv  desirable. 

Ue  propose  using  matrix  representations  of  the  continous  time 
operators.  The  definition  begins  bv  expanding  r( • )  and  s(*)  into 
generalized  Fourier  series: 


The  coefficients  {<  r,  <*>k  >}  and  {<  s,  >}  are  the  Fourier  coeffi¬ 
cients  of  r( • )  and  s(*)  with  respect  to  the  basis  (Ok) *  Then  the  matrix 
representation  Uij]  of  an  operator  £(♦),  where 


r(t)  =  £s(t) 


is  found  by  calculating  the  inner  product 


ii j  -  <  £$j ,  > 


(3.2-1) 


The  Fourier  coefficients  of  r  are  obtained  by  evaluating  the  sum 


r .  =  >  l .  .  s  . 
i  j  iJ  J 


Of  course,  the  model  parameters  { 2.  j }  depend  on  the'  choice  of  basis 
functions  used  to  model  the  signals.  Suitable  choices  for  practical 
applications  include  sampling  functions,  complex  exponentials,  prolate 
spheroidal  functions,  the  standard  basis,  and  Karhunen-Loeve  eigen¬ 
functions.  Some  important  properties  of  matrix  representations  are 
independent  of  the  specific  basis.  This  allows  the  modeling  and 
identification  problems  to  be  examined  from  a  fundamental  and  unified 
perspective. 


3.2.3  Fxample:  Time  Delay 

Let  us  work  several  specific  examples  to  illustrate  the  preceeding 
ideas.  Suppose  that  s(*)  is  a  band  limited  signal  whose  Fourier  trans¬ 
form  vanishes  outside  the  frequency  interval  (-o,  o).  Then  s(*)  has  the 
following  generalized  Fourier  series  representation: 
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s(t)  =  l  s(nT) 


sin  o(t  -  nT) 


c(t  -  nT) 


(3.2-2) 


where  T  is  the  sampling  interval.  If  the  sine  functions  are  normalized 
by  the  factor  1//T  ,  they  form  an  orthonormal  basis  for  this  subspace  of 
L2.  The  Fourier  coefficients  are  simply  the  samples  of  s(’). 

Recall  that  the  delay  operator  was  defined  bv  the  relation: 


ATs(t)  =  s ( t  -  t) 


If  s(*)  has  representation  (3.2-2),  what  is  the  matrix  representation  of 
AT( • )? 

From  the  result  presented  in  Equation  (3.2-1): 


a  =  <  A-r  <}>  ,  4> 

mn  1  n  m 


oo 

>  "7  / 


sin  o(t  -  t  -  nT)  sin  o(t  -  mT) 


o(t  -  T  -  nT) 


o ( t  -  mT ) 


dt  (3.2-3) 


This  integral  is  evaluated  in  the  Appendix  using  contour  integration. 
The  result  is: 


sin  j(t  -  (m-n)T) 


o(t  -  (m-n)T) 


(3.2-4) 


To  check  if  the  answer  is  reasonable,  suppose  the  delay  x  is  an  integer 
multiple  of  the  sampling  period  T.  Then 


sin  q(kT  -  (m-n)T)  _  sin  ir(k  -  ( m-n) ) 
o(kT  -  (m-n)T) 


ir(k  -  (m-n)) 


which  implies  that 
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m-n  *  k 


(3.2-5) 


m-n  =  k 


and  the  matrix  representation  of  AT(*)  reduces  to  a  shift  matrix 


analogous  to  the  Z  matrix  introduced  by  Kailath  and  his 


colleagues  [  35] . 


3.2.4  Digression:  Application  to  Non-stationary  Random  Processes 


The  matrix  representation  of  delay  operator  AT(*)  has  some  inter¬ 


esting  ramifications  in  the  theory  of  non-stationary  stochastic  proces¬ 


ses.  In  an  attempt  to  measure  "how  far"  an  arbitrary  discrete-time 


stochastic  process  deviates  from  wide  sense  stationarity ,  Kailath  and 


his  colleagues  introduced  the  concept  of  displacement  rank  [36]. 


The  displacement  matrix  6R  is  defined  by: 


SR  =  R  -  Z  R  ZT 


(3.2-6) 


where  R  is  the  infinite-dimensional  covariance  matrix  of  the  stochastic 


process  {x( *,*)},  and  Z  is  the  lower  shift  matrix  given  by  Equation 


(3.2-4)  for  k  =  1.  They  argued  that  the  rank  of  6R  is  a  measure  of  how 


far  x( • , • )  deviates  from  wide-sense  stationarity. 


The  Z  matrix  can  be  interpreted  as  the  matrix  representation  for 


Ax(«)  with  respect  to  the  standard  basis.  Calculating  the  matrix 


elements  in  this  case  is  trivial;  however,  only  integer  multiples  of  the 


sampling  interval  can  be  represented.  Our  representations  allow  these 


ideas  to  be  generalized  to  arbitrary  time  shifts. 
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Ass(t)  =  s(a(t  -  t)) 


which  is  the  combination  of  delay  and  stretching.  We  have  already  shown 
that  Ag(*)  is  equivalent  to  cascaded  Aa( • )  and  AT(*): 

Ag  (  *  )  =  Aa  A-f  ( • ) 

Its  matrix  representation  can  be  calculated  in  two  ways.  The  first  is 
by  the  original  definition: 

_1_  f  sin  a(qt  -  at  -  nT)  sin  o(t  -  mT) 

3mn  T  '  a(at  -  at  -  nT)  a( t  -  mT) 


with  the  result 


a 

mn 


sin  j(ar  -  (am  -  n)T/ct) 
a( at  -  (am  -  n)T/a) 


(3.2-8) 


The  second  is  by  cascading  the  matrix  representations  of  As(*)  and 
AT(-).  This  can  be  performed  by  multiplying  the  two  infinite  matrices 
together: 


a 

mn 


(a,r) 


l  a  (x)  a  (a) 

j=lco 


(3.2-9) 


where  the  (araj(*)}  are  the  delay  operator  matrix  elements,  and  the 
{a  jn ( “ )  J  are  t*16  stretching/compression  elements.  From  Equations 
(3.2-3)  and  (3.2-7),  we  want  to  calculate 


This  sum  can  be  evaluated  by  rearranging  Equation  (3.2-10)  and  applying 
Shannon's  Sampling  Theorem.  Rearranging  (3.2-10)  yields 


sin  gT(j  -  n/q)  •  sin( g( i  -  (m  -  j )T) 


j=- 


aT( j  -  n/a) 


o(t  -  (m  -  j )T) 


(3.2-11) 


Let  k  =  m  -  j .  Then  the  sum  becomes 


sin  aT(m  -  k  -  n/a)  sin  a(r  -  kT) 


aT(m  -  k  -  n/a)  a(r  -  kT) 


V  sin  a(mT  -  kT  -  nT/a)  sin  a(i  -  kT) 
_  a(mT  -  kT  -  nT/a)  o(t  -  kT) 


(3.2-12) 


Now  for  bandlinited  f(-), 


f(t)  =  J  f(kT)  Sln  g(t  ~  kT) 
n  1  .  L  '  o(t  -  kT) 

k  =  -<o 


We  immediately  equate 


cn.v\  -  sin  ^(mT  -  kT  -  nT/a)  _  sin  a(kT  -  mT  +  nT/a)  , „  ,  ,, 

^  1 .  rp  _  'T*  /  _  \  _  /  i  rr>  rr.  rr.  /  \  \  J  9  A  L  i  ) 


a(mT  -  kT  -  nT/a) 


a(kT  -  mT  +  nT/a) 


which  is  bandlimited  to  (-ao,  aa).  Therefore,  the  right  hand  side  of 
(3.2-12)  is 


sin  q( t  -  mT  -  nT/a)  _  sin  cr(ar  -  (am  -  n)T)/a) 
a(t-mT-nT/a)  a(at-(am-n)T)/a) 


which  equals  the  result  obtained  by  evaluating  inner  product  functional 
directly. 
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This  example  demonstrates  that  cascading  linear  operators  defined 
over  L2  spaces  and  their  matrix  representations  is  equivalent.  Compu¬ 
tationally  speaking,  it  is  usually  easier  to  determine  the  matrix 
representation  of  cascaded  operators  directly  from  the  definition  rather 
than  by  multiplying  large  or  infinite  matrices.  It  is  interesting  to 
note  that  by  applying  Equation  (3.2-9)  to  cascaded  operators,  we  obtain 
a  method  for  computing  products  of  infinite  matrices. 

3.3  Matrix  Representations  of  Stochastic  Operators 
3.3.1  General 

Two  fundamentally  different  matrix  representations  are  associated 
with  stochastic  operators:  Those  characterizing  £(•)  itself,  and  those 
representing  its  statistical  properties.  Both  are  useful  in  signal  and 
array  processing  applications.  In  this  section,  we  shall  examine  only 
the  first  type  of  representations.  Those  defined  for  statistical 
measures  of  £(•)  will  be  examined  in  Section  3.3.3. 

The  matrix  representation  of  a  stochastic  operator  £(•)  with  re¬ 
spect  to  a  basis  {<(1^}  is  defined  exactly  as  in  the  deterministic  case: 

2.  i  j  (  oj  )  =  <  £4>j  ,  >  (3.3-1) 
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Each  matrix  element  {£ij(*)}  is  a  random  variable  whose  numerical  value 
depends  upon  the  particular  realization  u)  of  £(•). 

3.3.2  Examples 

Suppose  £(•)  is  defined  in  terms  of  a  random  Green's  function 


v 


representation : 


£( • )  =  J  h( t ,  x ,  w)( • )  di 


The  matrix  representation  of  £(•)  is: 


CO  oo 

=  <  £<(>.»  <j)  >  =  /  {  /  h(t,  x,  ui)  4> .  (  t  )  dr}  $*(t)  dt 

1  3  1  -OO  -00  3 


=  /  /  h(t,  t,  w)  $.(x)  4>  *  ( t )  dx  dt  (3.3-2) 

— oo  ^ 

Notice  that  the  tine  dependence  of  h( •,*,•)  does  not  appear  in  its 
matrix  representation.  Therefore,  we  have  a  means  of  representing  a 
time-varying  system  with  coefficients  that  are  tine-invariant. 

The  random  Green's  function  representation  can  he  used  to  model 
range  spread  and  double  spread  media.  For  example,  a  signal  y(*,«) 
returned  from  a  range-spread  channel  can  be  expressed 

y(t,  ui)  =  /  b  (X,  oj)  s ( t  -  X)  dX 
L  R 

where  b^( •  ,  • )  is  a  sample  function  from  a  zero  mean  complex  Gaussian 
random  process,  and  X  is  the  spatial  variable.  Its  matrix  representa- 


^i  j  (w )  —  ^  £4>  j  .  t i  ^ 

OO 

=  /  {  /  b  (X,  w)  <J>  .  ( t  -  X)  d  X }  $*(t)  dt 


tion  is 


=  /  /  b(t  -  X/2,  X,  u)  -  X)  (J>* 


.(t)  dX  dt 


(3.3-4) 


3.3.3  Matrix  Representations  of  Covariance  Kernels 

We  will  find  that  the  matrix  representations  of  integral  covariance 
kernels  provide  useful  insights  into  several  problems  which  will  be 
studied  in  this  work.  Their  derivation  is  straightforward,  since  it  can 
be  shown  that  if  { 4>k. (  “ ) ^  is  a  complete  orthonormal  basis  spanning 
L2(T),  then  the  functions 


Vc’  s)  =  *k(t)  Vs) 


are  a  complete  orthonormal  basis  spanning  the  product  space  L2(T)  x 
L2(T)  [22].  This  is  the  space  of  all  square-integrable  functions 
k(*,*),  and  since  we  have  assumed  all  stochastic  processes  have  square- 
integrable  covariance  kernels,  they  belong  to  this  space.  Therefore,  in 
terms  of  the  basis  { ‘J’k.j, (*»*)}  , 


R  (t,  s)  =  l  l  <  R  ,  $  >  <fr.  (t)  $*(s) 

X  k=l  i=l  X  *  1  K  l 


(3.3-5) 


where 


<  Rx’  ^k  >  =  ft  Rx(t*  s)  't>k(t)  ds  dt 


(3.3-6) 


Equations  (3.3-5)  and  (3.3-6)  can  be  expressed  in  terms  of  expansion 
coefficient  cross-correlations ,  because 
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Rx(t,s)  =  E{  x(  t ,  w)  x*(s,  oj)} 


By  exchanging  integration  order  in  Equation  (3.3-6), 

<R  =  E{  j  f  x  ( t  ,  w)  x*(s,  u>)  <j>*(t)  <j>  (s)  ds  d  t } 

^  K  x,  XT  K  x 

=  E{[J  x(t,  w)  <f>*(t)  dt][/  x*(s,  u)  <t>  (s)  ds]} 

T  T 

=  E{  x^(o)  )  x*(w)} 

Therefore,  Rx( • , • )  can  be  written  in  the  form 

Rx(t,  s)  =  $H(s)  R*  *(t)  (3.3-7) 

where 

£T(t)  =  [<*»  i  ( t )  <|>  2 ( t )  •  •  .  ] 

and  the  infinite  matrix  is  a  matrix  of  Fourier  coefficient  cross¬ 
correlations.  Rjj  io  the  matrix  representation  of  Rx(*,*)  with 
respect  to  (♦it(*)}.  Notice  that  when  the  Karhunen-Loeve  basis  is  used 
the  off-diagonal  terms  of  lx  vanish.  In  other  words,  the  matrix  repre¬ 
sentation  of  Rx(*,*)  is  diagonalized. 

3.4  Representations  for  Vector-Valued  Processes  and  Transformations 
3.4.1  Signal  Representations 

Numerical  representations  of  vector-valued  signals  are  essential 
for  array  processing  applications.  Therefore,  we  next  take  up  the 


--  r . 


problem  of  representing  signals  x( • )  defined  over  the  product  space 
L2(a,  b)  x  where  the  interval  (a,  b)  is  the  index  set  of  continuous 
time  variable  t,  and  is  the  set  of  conplex-valued  N-tuples  repre¬ 
senting  N  channels  of  data.  We  seek  equivalent  l 2  representations  of 
this  data. 

The  first  step  in  solving  the  problem  is  defining  a  suitahle  inner 
product  functional  over  the  Hilbert  space  l^Ca,  b)  x  C^.  i'ne  such 
functional  is 


<  x,  _v  >  =  J  xT(t)  v*(t)  dt  (3.4-1) 

a 

which  is  convenient  for  signal  processing  applications,  because  the 
induced  norm  ||x||  has  the  interpretation  as  total  signal  energy  summed 
over  all  N  channels. 

Provided  a  complete  orthonormal  basis  exists, 

OO 

x(t)  =  )  x.  4>  ( t  )  ( 3.4-2) 

—  . L .  l—l 

1  =1 

and  as  before,  the  Fourier  coefficients  are  given  bv 

x^  =  <  x,  >  (3.4-3) 

for  all  i . 

In  principle,  Equations  (3.4-2)  and  (3.4-3)  can  be  used  to  obtain  a 
numerical  representation  for  x(«).  However,  it  is  not  readilv  apparent 
how  1  suitable  basis  {$_!<;(•)}  spanning  the  product  space  I.nia,  b)  x  O'1 


can  be  found.  -i  method  of  constructing  an  orthonormal  basis  from  those 
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which  span  the  individual  spaces  L2(a,  b)  and  C ^  is  needed.  The 
following  theorem  is  a  generalization  of  a  result  discussed  in  Gohberg 
and  Goldberg  [22],  and  allows  an  appropriate  orthonormal  basis  to  be 
constructed . 

Theorem  III.  1 :  If  (<Pi)  is  an  orthonormal  basis  for  L2(a,  b),  and 
{ui}  is  an  orthonormal  basis  for  C^,  then  the  functions 


!ij  (t )  =  4>i(  t )  Uj 


(3.4-4) 


form  an  orthonormal  basis  for  loCa,  b)  x  c\  for  i  =  1,  2,...,  and 
i  =  1  ,  2,  . . . ,N. 

Proof :  To  demonstrate  that  {$j.j(*)}  is  an  orthonormal  basis,  it 

suffices  to  show  that  any  two  basis  elements  are  orthonormal,  and  that 
<  _g,  >  =  n  implies  that  is  identically  zero  almost  everywhere. 

Orthonormaiitv  follows  easilv: 


<*,„,$  >  =  /  $ .  (t)  $*  (t)  dt 

--kit  — tin  J  — *0  — nn 
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b  b 

f  (t)  e  e*  <$*(t)  dt  =  e  e*  {  $  (t)  $*(t)  dt  =  <5  6. 

'  k  —  l  — m  n  —  l  — n  J  k  n  £m  \ 
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Next,  it  is  necessary  to  show  that  if  g  e  L2(a,  b)  x  C1'  and  if 


<  £.  iij  >  =  ') 


(3.4-5) 


then  =  _0  almost  everywhere, 
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Expanding  Equation  (3.4-5)  gives 


<  g,  .  >  =  /  g T(t)  <f*(t)  e*  dt  =  gT  e*  =  0  (3.4-6) 

-  -ij  -  l  -.1  -i  -J 

ci 

where  _g^  is  defined  by 

b  b 

■  [  /  gj(t)  <t>  *  ( t )  dt  .  .  .  /  g  N  ( t )  <J>  *  ( t )  dt]  (3.4-7) 

a  a 

an  N’-tuple  of  complex  scalars.  Now,  since  { £j }  is  an  orthonormal  basis 
for  CN,  Equation  (3.4-6)  implies  _g^  =  0  for  all  i.  This  means  that 

h 

/  g  (t)  4>*(  t )  =  0  (3.4-8) 

m  l 

a 

where  (3.4-8)  is  the  element  of  jg^ .  (3.4-8)  immediately  implies 

that  for  m  =  1,  2,...,  N,  gm(*)  is  identically  zero  almost  everywhere, 
because  {$£(•)}  is  an  orthonormal  basis  for  L2(a,b).  It  follows  from 
Equations  (3.4-6)  and  (3.4-8)  that  for  (3.4-5)  to  hold  for  all  possible 
i  and  j ,  g  is  identically  0  almost  everywhere.  Therefore,  from  Theorem 
11.3  in  Gohberg  and  Goldberg,  {_$ij(')}  is  an  orthonormal  basis  for  L2(a, 
b)  x  CN. 

Now  that  an  orthonormal  basis  for  L2(a,  b)  x  has  been  con¬ 
structed,  a  representation  for  elements  x(  • )  belonging  to  it  can  be 


developed : 


L4t(t)  ‘  j,  j,  <  L4c4j  >4,"' 

1=1  j=l 


Combining  Equations  (3.4-9)  and  (3.4-10)  yields 


(3.4-10) 
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(3.4-11) 


Now,  this  equation  expresses  v_(  • )  in  terms  of  {$j.j(*)}»  and  of  course, 


v(t)  =  l  l  y. ,  !..(t)  =  l  l  <  y.  ii-  >  *..(t) 
i  i  j  j  j  j  J  J 


(3.4-12) 


Equating  coefficients  <  _v,  >  in  (3.4-11)  to  the  equivalent  represen¬ 

tation  in  Equation  (3.4-12)  implies  that 


I*  ii.i >  =  n <  ^  ikt  > 


or  that 
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where 
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The  example  we  will  work  in  this  section  is  to  find  a  matrix  repre¬ 
sentation  for  a  beansteering  operator  Lg(*): 

Lg  _s  ( t )  =  [s(t  -  t)  ...  s(t  -  (n  -  l)t)]T 


>VvJ.SK 

«  -V 


Lg(*)  represents  the  processing  needed  to  steer  the  response  of  a 
uniformly  spaced  line  array  towards  a  desired  look  direction.  The 


vector  s(*)  is  defined  by 


;(t)  =  [s(t )  .  .  .  s(t) ]T 


for  -it  <  t  <  if.  Assuming  s(*)  is  periodic  with  period  T  = 


s(t)  =  I  *n  e 

n=-oo 


where  the  fundamental  frequency  u0  is  unity. 

Convenient  representations  for  s(’)  and  v( • )  are 
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(3.4-16) 
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The  matrix  representation  of  Lg(  • )  relates  the  {s-jj}  coefficients  to  the 
{ y i j }  coefficients 


no 


=  k=L  til  Sr£ 


(3.4-20) 
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and  the  coefficients  {£kJ£mn}  are  found  by 

^■k£mn  =  ^  kg  _$ran  >  )Lk£  =  4  ♦n  ,£n »  $k  .££  ^ 

The  product  space  L,2(— it,  it)  x  Cfl  is  a  Hilbert  space  with  inner  product 
functional 


<  _x>  v.  >  =  /  >cT(t)  y*(t)  dt 


(3.4-21) 


for  all  elements  r.  and  _v  in  H. 

Let  us  demonstrate  how  a  judicious  choice  of  basis  simplifies  the 
matrix  representation  of  Lg(*).  A  natural  selection  for  the 
functions  is  the  complex  exponentials: 

$k(t)  =  eJkt 

and  the  standard  basis  for  the  {ej<}  vectors  (k  =  1,...,  M).  Then 


it 

i  =  f  L  <p  (t)  4>*(t)  dt  •  ef  e  =6  /  e- 

k£mn  ‘  t„  k  rm  —  £  — n  £n  1 
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6  e~jkT£  /  ej(k  "  m)t  dt  =  e'jkT£  5,  5„ 
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(3.4-22) 
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for  k  =  ...,  -1,  0,  1,...,  and  £  =  1,  2,...,  M.  The  i£  is  the  delay 
introduced  into  the  £th  sensor  measurement. 

Equation  (3.4-22)  shows  how  the  selection  of  basis  simplifies  the 
matrix  representation.  Its  kth  column  is 
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which  is  Che  steering  vector  for  a  narrowband  signal  centered  at  w  = 
ka)0.  This  shows  that  a  narrowband  steering  vector  is  simply  a  special 
case  of  a  more  general  beamsteering  operator  which  applies  to  wideband 


processing . 


3. 4. A  Matrix  Representations  for  Multipath  Channels 

In  the  next  example,  matrix  representations  for  multipath  propaga¬ 
tion  channels  will  be  derived.  The  received  data  are  defined  over  a 
Hilbert  space  L2(a,  b)  x  C^*  and  have  the  canonical  representation 
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r ( t ,  oj)  =  l  l  r.  .  (to)  (j>.  (t )  e  . 
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The  transmitted  signnl  s(*)  will  be  modeled  as  a  deterministic,  scalar, 
periodic  function  with  Fourier  series  expansion 
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How  can  the  representation  {s^}  be  related  to  the  {rj_j  (“ )}  para- 
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meters  in  a  multipath  channel? 

In  a  multipath  environment,  energy  from  a  common  source  arrives  at 
the  array  from  several  directions.  A  model  must  take  three  phenomena 
into  account;  namely,  a  time  delay  ip(*)  common  to  every  path  due  to 
range  delay,  a  random  real-valued  scalar  representing  propagation  loss 
due  to  distance,  and  time  delay  due  to  wavefront  propagation  across  the 
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array  itself.  The  latter  time  delay  depends  on  the  arrival  direction  of 
the  wavefront  and  the  array  geometry. 

Without  loss  of  generality,  the  matrix  representation  will  he  de¬ 
rived  for  a  single  arriving  wavefront.  The  result  is  easily  generalized 
to  multiple  wavefronts  due  to  linearity  of  the  medium. 

In  the  absence  of  noise,  the  array  measurement  at  time  t  is: 


r(t,  w)  = 


b(w)  s(t  -  Tp  -  tj) 


|_b(u)  s ( t  -  Tp  -  Tfl)_j 


( 3.4-23) 


where  b( • )  represents  propagation  loss,  rp  is  the  delay  due  to  distance 
traveled,  and  is  the  time  delay  at  the  ith  sensor  due  to  the  arrav 
geometry . 

Rewriting  Equation  (3.4-23)  in  operator  notation  gives 


r(t,  w)  =  £s( t ) 


whe  re 


£s(t)  = 


0  I  I b(w) 


Lt  0 
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M  s(  t  ) 


and  both  r(*,*)  and  s(*)  can  be  expanded  as: 
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The  matrix  representation  of  £(•)  is  computed  by 


^  ^.k£  ^ 


where  the  inner  product  functional  <  *,*  >  is 
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<  _x  >  >  =  /  x  ( t )  d t 
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Before  proceeding  with  the  calculation,  a  suitable  basis  trust  be 
chosen.  We  have  already  proved  that  a  basis  snanning  LoCa,  b)  x  o'-'  can 
be  constructed  from  products  of  basis  functions  spanning  the  indi¬ 
vidual  spaces.  In  principle,  one  could  choose  anv  arbitrary  {$k}  and 
{e_k}  from  a  wide  range  of  choices.  However,  a  judicious  selection  of 
basis  night  simplify  the  matrix  representation.  Since  s(’)  is  periodic, 
the  natural  choice  for  the  scalar  basis  is  the  set  of  complex  exponen¬ 
tials.  It  is  not  clear  what  constitutes  a  good  choice  for  the  {o^} 
basis,  therefore,  let  us  try  the  standard  basis  first. 

In  terms  of  the  standard  basis. 
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(oj)  =  <  £$  ,$,>=/  (t)  e  )T  $*(t)  e*  dt  (3.4-24) 
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where  Tq  is  the  observation  interval,  and  Tq  =  2tt/cjq, 
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Substituting  (3.4-25)  into  Equation  (3.4-24)  gives 


*kinn(“)  =  6£n  exPt-jmw0Tp}  exp{  "im“oTn}  b(u)  ^  *m(t)  <t’k(t)  dt 

Tn 


(3.4-25) 
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for  m  or  k  integer,  and  i  or  n  =  1 ,  2,...,  M.  The  coefficients  vanish 
if  n  *  k  or  l  *  n,  which  means  that  the  tensor  representation  of  £(•) 
reduces  to  a  simpler  matrix  structure.  Indeed,  the  kth  column  of  the 
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(3.4-27) 


The  column  vector  is  simply  a  steering  vector  for  a  narrowband  signal  of 
frequency  kwo,  meaning  that  the  matrix  operator  can  be  interpreted  as 
an  infinite  set  of  steering  vectors  acting  on  the  individual  frequency 
components . 

In  order  to  generalize  the  result  to  multipath  channels,  the 
representation  for  each  path  is  summed  element  wise,  which  is  permitted 
since  we  have  assumed  a  linear  medium.  The  k-th  column  of  the  matrix 
becomes 


l  b  (w )  e  jkVp.  v 
t-1  1  1  ~kl 


( 3.4-28) 


where  p  is  the  number  of  paths,  t  is  the  delay  due  to  the  i-th  path, 

Pi 

and  {_v^i)  are  the  narrowband  steering  vectors  associated  with  the  k-th 
Fourier  coefficient  of  s(*)  and  the  i-th  wavefront  arriving  at  the 


3.4.5  Discnssi on 

A  special  case  of  matrix  representation  (3.4-28)  has  appeared  in 
the  literature.  Recently,  Paulraj  and  Kailath  [33]  developed  a  multi- 
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path  channel  model  in  the  context  of  an  optimal  beamforming  problem 


Their  model  for  the  arrav  output  r( • , • )  in  the  absence  of  noise  is 

written : 


_r(t,  u>)  -  A  f(u)  s(t  )  (  3.4-29) 

where  s ( • )  is  a  deterministic,  narrowband  signal,  A  is  an  M  x  p  matrix 
of  steering  vectors  representing  the  arrival  directions  of  p  wavefronts, 
and  f( • )  is  a  p  x  1  random  vector  whose  ith  element  is  a  complex  scalar 
representing  the  path  loss  and  phase  shift  in  the  ith  path. 

Let  us  demonstrate  that  their  multipath  channel  model  is  simply  a 
special  case  of  our  model.  Assuming  that  s( •  )  is  a  narrowband  signal 
in  the  form 


s(t) 


jut  t 
e  o 


then  s (  • )  is  a  special  case  of  the  Fourier  expansion  used  in  the  last 
section , 


0  k  *  I 
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and  rearranging  A  f(w)  gives 


A  f(w) 
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From  Equation  (1.4-28),  the  matrix  representation  relating  the  series 


expansion  coefficients  of  s(*)  to  those  or  r(  • ,  •  1  is 


Equating  the  {  f  j  (  -  ) }  terms  with  the  <  bj (  •  >  exp(  -  i  w,,t  p  );  roe  r  f  i  c  i  en  t  s  , 
and  equating  the  steering  vectors  { a  j }  with  {v-jj  demonstrates  that 
Paul  ra  )  and  Kailath's  multipath  model  reduces  to  a  special  case  of 
Fquat ion  (3 .4-28 ) . 


3 .  3  Cone  Ins i ons 

Matrix  representations  of  hounded,  linear  operators  provide  a 
convenient  means  ot  modeling  determini  stir  and  stochastic  signal  trans¬ 
formations  which  arise  in  arrav  processing  problems.  Thev  are  well 
suited  for  digital  signal  processing  applications  since  thev  are  based 
upon  orthonormal  representations  of  those  signals  under  examination.  It 
is  clear  that  selecting  a  basis  judiciously  can  simplify  the  operator 
modeling  problem  a  great  deal. 

'.!e  have  obtained  new  results  that  shed  new  light  into  classical 
signal  theory  issues.  In  particular,  the  representations  for  time  de lav 
operators  gives  new  insight  into  t  he  tlieorv  of  non-stat  i  on.arv  stochastic 
Processes.  of  course ,  our  interest  in  matrix  representations  g >es 
hevond  theoretical  cons i ie ra t i ons  alone,  "be  final  purpose  of  this  work 
is  to  develop  stochastic  and  deterministic  models  that  can  easily  he 
incorporated  into  the  estimator-correlator  for  subsequent  identification 
in  -on  iur.c  t  i  on  with  detection. 

These  comments  lead  to  a  verv  important  issue  which  deserves 
further  Ms.ussion.  \I1  >>♦  the  models  presented  in  this  chapter  are 
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based  on  ortbonormal  series  expansions  of  deterministic  and  random 


-  *  •  * 


processes.  Of  course,  manv  other  methods  of  signal  and  system  modeling 
have  been  extensively  studied  over  the  last  fort v  wars,  especially 
state  space  and  parametric  or  rational  transfer  function  models.  Whv 
has  a  Fourier  series  expansion  approach  to  the  modeling  problem  been 


la 
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selected' 


First,  this  approach  gives  considerable  insight  into  the  nature  and 
structure  of  £(•).  For  example,  we  showed  how  a  he, informing  operator 
for  wideband,  periodic  signals  can  he  represented  as  a  matrix  whose 
columns  can  he  interpreted  as  narrowband  steering  vectors.  This  is  an 
intuitively  pleasing  result,  and  it  gives  physical  meaning  to  an 
abstract  result  that  might  he  completely  missed  if  other  models  we  re 


Second,  these  representations  allow  for  a  great  deal  of  flexi¬ 
bility.  Anv  basis  spanning  the  signal  space  can  be  used,  and  we  have 
already  demons t  rated  that  a  judicious  selection  of  basis  can  simplify 
the  modeling  problem.  Furthermore,  this  allows  the  detection  problem  to 
he  solved  genericallv  for  an  arbitrary  set  or  generalized  Fourier 
coefficients.  This  is  important,  since  discrete  Fourier  transforms  are 
user!  frequently  in  practical  applications. 

Matrix  operators  avoid  difficulties  inherent  with  representing 
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non-s t a t i ona r v  signals  and  systems.  Recall  that  matrix  representations 
of  time-varving  systems  are  themselves  t i me -i nva r i ant .  This  property 
simplifies  the  modeling  and  identification  problems  considerably.  Mn 
the  other  band ,  modeling  time-varving,  stochastic  systems  with  rational 
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transfer  function  representations  is  exceedingly  difficult,  and  in  our 
opinion,  not  roa t hena t ica 1 1 v  mature. 

The  most  important  reason  whv  this  approach  has  been  selected 
relates  to  the  structure  of  the  estimator  kernel  Gi*,*).  Orthonormal 
series  expansions  will  he  used  to  implement  (!(•,•),  and  the  matrix 
representations  for  £(•)  can  he  incorporated  into  the  structure  in  a 
straightforward  manner.  The  Karhunen-Loeve  basis  is  preferred  because 
it  simplifies  the  solution  for  G( • , • ) ,  and  it  represents  a  fundamental 
approach  to  the  stochastic  svstem  modeling  and  identification  problem. 
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Chapter  4 


IMPLEMENTING  THE  OPTIMAL  STRUCTl’RE 


4 . 1  Introduction 

This  chapter  addresses  several  crucial  issues  which  arise  in  the 
process  of  solving  the  estimator-correlator  equations: 

/  _R  ( t  ,  11 )  _0(  u  ,  z  )  du  =  6(  t  -  z  )_I  (4.1-1) 

T 

/  _R  (t,  u)  G(u,  z )  du  =  _Rv(t,  z)  (4.1-2) 

T 


First,  it  is  necessarv  to  establish  a  suitable  mathematical  representa¬ 
tion  for  the  covariance  and  filter  kernels  in  order  to  solve  Equations 
(4.1-1)  and  (4.1-2).  It  is  not  clear  from  the  definitions  of  R  ^ ( *  ,  *  ) , 
V,-),  and  Ry( • ,  • )  alone  how  to  formulate  solutions  for  (>(•,•)  and 
G(,,,)«  Moreover,  obtaining  numerical  representations  which  are 
reasonably  easv  to  manipulate  in  hardware  is  essential  for  adaptive 
realization  of  the  processor. 

Another  issue  which  must  he  resolved  is  more  fundamental.  So  far, 
we  have  presupposed  perfect  a  priori  knowledge  of  the  covariance  ker¬ 
nels.  However,  this  is  an  unreasonable  assumption  in  most  practical 
situations;  therefore,  it  is  necessarv  to  estimate  the  kernels  from 
array  measurements.  Kernels  R  ^ ( • , * )  and  _Rv(  • , • )  can  he  estimated 
directlv  from  arrav  data  using  standard  adaptive  techniques,  '>n  the 
other  hand,  Rv(»,0  can  not  he  estimated  direct Iv  from  arrav  data,  since 
the  signal  component  is  alwavs  obscured  hv  additive  measurement  noise. 
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If  a  priori  knowledge  of  Ry  ( • ,  •  )  is  unknown  or  incomplete,  a  means  of 
estimating  it  in  conjunction  with  detection  nust  be  found.  incomplete 
knowledge  of  Rv ( • , • )  represents  a  fundamental  limitation  which  must  be 
overcome  if  the  optimal  structure  is  to  he  realized. 

Third,  the  identification  scheme  for  _Ry(*,*)  must  be  signal  inde¬ 
pendent.  In  other  words,  once  the  channel  operator  £(•)  has  been 
identified  based  on  probing  signal  measurements,  we  should  he  able  to 
use  the  results  to  estimate  Rv(*,*)  for  an  arbitrary  transmitted 
signal . 

Finallv,  a  means  of  incorporating  a  priori  knowledge  of  the 
scattering  channel  into  the  detector  structure  is  needed.  For  example, 
we  may  know  that  £(•)  belongs  to  a  certain  class  of  channels  character¬ 
ized  bv  one  or  more  parameters.  When  one  or  more  of  these  parameters 
are  known,  or  if  hounds  can  he  placed  on  their  values,  the  abilitv  to 
incorporate  this  information  into  the  processor  would  be  useful. 

We  begin  bv  reviewing  the  Karhunen-hoeve  orthonormal  expansion  and 
its  relationship  to  the  spectral  representation  of  linear  operators. 

This  expansion  is  the  fundamental  tool  which  will  be  used  to  analvze  and 
implement  the  processor.  We  assert  that  it  provides  the  theoretical 
means  to  solve  for  the  processor,  gives  considerable  insight  into  its 
mathematical  structure,  and  establishes  a  link  between  theoretical 
analysis  and  implementation. 

Next,  ve  will  show  how  these  ideas  can  be  used  to  solve  for  the 
processor  structure.  Finding  0( • , • )  is  straightforward.  Solving 
for  the  estimator  branch  is  somewhat  more  difficult,  but  much  more 
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interesting.  Once  again,  a  Karhunen-Loeve  representation  provides  the 
solution  for  G(  • , • ) ,  and  also  suggests  an  implementation  scheme. 
Furthermore,  it  provides  useful  insights  into  the  modeling  and  identi¬ 
fication  problem.  In  Section  4.6,  the  relationship  between  £(•)  and  the 
estimator  kernel  is  derived.  Identification  of  £(•)  is  required  in 
order  to  calculate  the  conditional  mean  estimate. 


4 . 2  Covariance  Kernel  Representations 

The  key  to  solving  Equations  (4.1-1)  and  (4.1-2)  is  the  spectral 
representation  of  covariance  kernels  Rjj(*,*),  Ry (*,*),  and  Rj  ( • ,  •  )  . 

The  relationship  between  the  Karhunen-Loeve  expansion  of  a  process 
x( • , • )  and  the  spectral  representation  of  its  covariance  kernel  Rx(*,*) 
can  be  easily  demonstrated.  Recall  that  Rv(*,»)  is  defined  by: 


?U(t,  u)  =  E{x(t,  ai )  xH(u,  w)} 


(4.2-1) 


for  t  and  u  within  the  observation  interval  T.  Expressing  Equation 
(4.2-1)  in  terms  of  the  Karhunen-Loeve  expansions  gives 


Rx(t,  u)  =  E{[  l  xk( u>)  iR(t)][  l  x  (u>)  ^(u)] 


GO  00 

=  E{  l  l  xk(io)  x*(u>)  $k(t)  ij(u)} 


k=l  £=1 


=  l  l  E{x  (ui)  x*(u>)}  £  (t)  £?(u) 


(4.2-2) 


k=l  1  =  1 
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However,  the  expansion  coefficients  are  uncorrelated,  therefore, 
Equation  (4.2-2)  reduces  to 


u)  =  I  E{lxk(“)|  i  ik<‘u) 

k=l 


m 
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Furthermore,  the  E{|x^(aj)|2}  terras  correspond  to  the  eigenvalues  {X^ j 
of  the  integral  equation  whose  kernel  is  RxC*,*)*  Therefore: 
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00 

Rx(t,u)  =  l  X^X)ik(t)^(u) 


(4.2-3) 


which  is  the  spectral  representation.  This  discussion  shows  how  the 
uncorrelated  expansion  coefficients  simplify  the  numerical  representa¬ 
tion  of  the  covariance  function,  because  they  reduce  a  double  sum 
representation  to  a  single  sum  representation.  Stated  another  way, 
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the  Karhunen-Loeve  basis  diagonalizes  the  matrix  representation  of 
R  v  ( * » * )  • 


4.3  The  Inverse  Filter  Branch 


4.3.1  Derivation 


To  illustrate  how  these  representations  are  used,  consider 
Equation  (4.1-1): 


mm 


/>/*» 


/  JTft,  u )  _Q( u ,  z)  du  =  5(t  -  z)_I 
T 


and  solve  for  the  inverse  filter  Q(*,*). 
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l  lw(t>  !w(z) 


(4.3-5) 


k=l 


It  can  be  shown  that  (4.3-5)  equals  6(t  -  z)_I,  where  6(*)  is  the  Dirac 
delta  function.  In  other  words,  j^(*,*)  and  Q( • , • )  are  inverse  kernels, 
and  the  inverse  filter  solution  is  given  by  (4.3-2). 


4.3.2  Discussion 


Equation  (4.3-2)  is  the  solution  for  £)(•,•)  expressed  in  spectral 
form.  Clearly,  once  the  spectral  representation  of_Rj^(*,*)  is  found, 
calculating  _Q( • , • )  is  straightforward,  because  one  only  needs  to 


(n). 


calculate  the  reciprocals  of  the  eigenvalues  }.  This  simple 


procedure  demonstrates  the  usefulness  of  a  spectral  representation 
approach. 

Although  the  preceding  derivation  for  _Q( • , • )  is  mathematically 
correct,  several  difficulties  must  be  resolved  before  considering  an 
implementation  based  on  this  solution.  First,  we  have  yet  to  discuss 
how  the  eigenvalues  and  eigenfunctions  used  to  represent  _Rjv](  •  ,  • )  are 
found.  Solving  for  them  even  given  perfect  knowledge  of  _Rjj(  •  ,  • )  means 
solving  a  matrix  integral  equation,  clearly  not  a  simple  task. 
Furthermore,  since  _R^( • , • )  is  usually  unknown  a  priori,  the  kernel  must 
be  estimated  from  array  data.  Exactly  how  this  can  be  done  remains  to 
be  seen.  Finally,  the  numerical  issues  involved  with  hardware 
implementation  must  be  examined.  Since  the  calculations  associated  with 
the  preceeding  discussion  are  often  be  performed  on  finite-precision 
fixed  point  hardware,  robust  computational  algorithms  ought  to  be  used. 
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Each  of  these  issues  will  be  addressed  later  in  the  dissertation. 
However,  for  now  let  us  assume  that  these  difficulties  can  be  resolved, 
and  turn  our  attention  to  solving  for  the  estimator  kernel  G( • , • ) . 


4.4  The  Estimator  Branch 
4.4.1  Derivation 

In  order  to  solve  Equation  (4.1-2),  suppose  that  _Ry('.')  and 
RjC*,’)  can  be  expanded  with  respect  to  the  same  orthonormal  basis 

Uk(  * )} : 


00 

R  Ct,  u)  =  l  \f-y)  ±  (t)  <^(u) 


Rj(t,  u)  =  l  A^r;  £k(t)  *"(u) 
k=l 


(4.4-1 ) 


(4.4-2) 


The  basis  is  not  the  same  as  used  to  expand  _Q( • , • ) .  We  claim  that 
G( • , • )  is  given  by 


G(t,  U)  =  l  J^-^U)  <£(u) 


(4.4-3) 


k=l  A 


To  prove  this  assertion,  substitute  (4.4-2)  and  (4.4-3)  into  the  left 
hand  side  of  Equation  (4.1-2): 


/  [  l  A£r)  !k(c)  £  -777  ±£(u>  i^z)]  du 

T  k- 1  1  X  ^ 


(4.4-4) 
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Both  infinite  series  converge  uniformly;  therefore,  (4.4-4)  can  be 
rearranged,  yielding 
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00  OO  ^  ' 

I  I  xkr)  “77yik(t)  1/  !k(u)  i£(u)  dul  *  ±k(z) 

c— 1  J2,—  1  X .  T 


(4.4-5) 


Once  again  appealing  to  the  orthogonality  condition  of  the  basis  func¬ 
tions  leads  to  the  result  that  (4.4-5)  simplifies  into 
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OO 

I  X^y)  lk(t)  lw(z)  =  Iv(t’  z) 


(4.4-6) 


which  proves  chat  Equation  (4.4-3)  is  the  solution  for  G(*,‘). 


4.4.2  Discussion 

The  solution  for  the  estimator  branch  kernel  depends  on  a  crucial 
assumption;  specifically,  the  ability  to  expand  the  covariance  kernels 
jty(*,*)  and  Rj(*,«)  with  respect  to  the  same  basis  (_£k(*)}.  This  is 
equivalent  to  simultaneously  solving  the  equations 


X.  X  J>  (t)  =  /  R  (t,  u)  $  (u)  du 
k  — k  ‘  —v  —V 


(4.4-7) 
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X^r  _$^( t )  =  /  R,(c,  u)  _£k(u)  du 


(4.4-8) 


with  the  same  set  of  orthogonal  eigenfunctions  {_$k(,)}»  From  a  more 
abstract  point  of  view,  this  is  equivalent  to  simultaneous  diagonaliza- 
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tion  of  two  linear  operators  122].  It  is  essential  to  establish  the 
conditions  when  this  is  possible. 

It  turns  out  that  simultaneous  diagonali zat ion  is  trivial  if  the 
noise  covariance  kernel  corresponds  to  the  covariance  function 

of  a  white  noise  process.  Since 


+  jy***)  (4.4-9) 

if 

RN(t,  u)  =  6 ( t  -  u)I  (4.4-10) 

then  by  substituting  Equations  (4.4-9)  and  (4.4-10)  into  Equation 
(4.4-8),  it  is  clear  that  (J^(*)}  solves  both  relations.  In  other 
words,  if  the  noise  process  _n(  • ,  •  )  is  spatially  and  temporally  white, 
simultaneous  diagonalization  of  R^(*,*)  and  R  (•>•)  is  straightforward. 

However,  making  a  white  noise  assumption  as  a  part  of  the  problem 
formulation  is  unrealistic.  Therefore,  it  is  necessary  to  establish 
the  conditions  when  simultaneous  diagonalization  is  possible  given  a 
colored  noise  process  n( • , • ) . 

Simultaneous  diagonalization  can  be  achieved  by  decorrelating  the 
Fourier  coefficients  of  n(*,*).  The  various  relations  among  the 
covariance  and  filter  operators  is  most  easily  seen  with  the  £ 2  repre¬ 
sentations  for  the  various  signals  and  operators  involved.  In  terms  of 


Fourier  series  representations ,  the  measurement  model 


(4.4-12).  The  modified  noise  covariance  kernel  becomes  an  identity 
operator;  however,  it  is  not  obvious  that  the  modified  signal  plus  noise 
and  signal  covariance  kernels  can  be  simultaneously  diagonalized.  Is 
this  still  possible? 

The  answer  to  this  question  is  yes.  We  have 


E{£'(aj)(r'(w))H}  =  E{C  r(a))( r%) )  CH}  =  C  R  CH  = 


C  Ry  CH  +  C  Rn  _CH  =  C  Ry  CH  +  I 


r’  =  r'  +  I 
-1  -y  - 


(4.4-13) 


(4.4-14) 


A  theorem  from  functional  analysis  [22]  states  that  a  necessary  and 
sufficient  condition  for  simultaneously  diagonalizing  two  linear  opera¬ 
tors  A( • )  and  B( • )  is  that 


AB( • )  =  BA( • ) 


R.’  =  R'  +  I  =  R'(I  +  r'  l) 
-1  -y  -  -y  -  -y 


r’(r'  +  i)  r'-1  =  r'(r')  r'"1 

— y  — y  —  —  y  —  y  —1  — y 


(4.4-15) 
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kev  Co  practical  implementation  as  well.  Uhv  do  we  make  these 
assertions? 

First,  evaluating  the  conditional  mean  estimate  from  array  measure¬ 
ments  is  simple.  This  can  be  seen  clearly  by  expanding  0(  •  ,  • )  in  terms 
of  Equation  (4.4-3): 


,(y) 


/  G(  t ,  u)  _r(u,  ui )  du  =  /  [£  — - — r.  (t  )  t*i^(u)  ]  r  (u ,  uj)  du 

T  T  k  XVr;  * 

k 


*<*> 

=  ^<r, 


,  (v) 


k  X 


(r)  -k  '  -k 


>  *.  (t)  = 


k  X 


(r) 


r  (u)  _£k(t ) 


The  coefficients  {r^(*)}  are  obtained  from  the  array  measurements  by  an 
inner  product  operation.  Calculating  the  conditional  mean  is  straight¬ 
forward  because  each  coefficient  is  multiplied  by  a  real  number.  This 
operation  is  analogous  to  postmultiplying  an  N  x  1  vector  of  coeffi¬ 
cients  by  an  N  x  N  diagonal  matrix  where  only  N  multiples  are  needed. 
The  preceding  remarks  suggest  this  structure  reduces  computational 
complexity. 


The  parameters  {X^1"^}  and  { ^ }  give  insight  into  the  channel 


operator  identification  problem,  and  suggest  a  method  of  obtaining 


(r). 


G(  •  ,  • )  adaptively.  Recall  that  the  { X^  "l  terms  are  the  variances  of 


the  expansion  coefficients  of  r(  •  ,  • )  given  Hp 

,(r) 


=  E{ | rk(w) |  } 


(4.5-1) 
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coe  f  fi cients : 


The  { X^  }  parameters  are  the  variances  of  the  signal  process 
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vector  of  Fourier  series  expansion  coefficients  of  the  probing  signal, 
If  the  Karhunen-Loeve  representation  of  v(*,*)  is  used,  then 


R  =  diagU^  X<?>  . 
—v  1  2 


(4.6-2) 


which  are  the  unavailable  parameters  in  the  estimator  structure.  The 
ith  element  in  Ry  is: 

E{  | yi (u> )  i 2}  =  =  I  I  ^in^^  s(m)  s*(n)  (4.6-3) 


This  shows  how  the  ^  terms  are  related  to  the  second  order  statis¬ 

tics  of  the  model  parameters.  Further  insight  is  gained  by  evaluating 
the  cross-correlations: 
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E{ l .  (u)  l*  («)>  =  E{<  £♦,♦.><£♦,*.  >*}  (4.6-4) 

lm  m  mi  ni 


In  terms  of  a  random  Green's  function  representation  for  £(•), 


E{<  £6  ,  4> .  >  <  £*,  4> .  >  > 

mi  n  l 

E{  [//  h(t,  t,  gj )  <t>  (t)  4>t(t)  dt  dr]  [ / /  h(u,  v,  w)  <J>  (v)  4>*(u)  du  dv]*} 
mi  n  l 

=  ////  E{h(t ,  i,  w)  h*(u ,  v,  w)}  4>*(t)  <j>.(u)  4>  (x)  4>*(v)  dt  du  dr  dv 

limn 

=  ////  G(  t ,  x,  u,  v)  $*(t)  $  (u)  <fr  (t)  *  ( v )  dt  du  dx  dv  (4.6-5) 
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The  expectation  of  the  product  h(t,  i,  u)  h*(u,  v,  u)  appears  quite 
often  in  the  stochastic  system  theory  literature,  and  is  important 
enough  to  warrant  a  name.  G( *,*,*,•)  is  called  the  stochastic  Green's 
function  of  kernel  h( •,*,*)  [30],  It  is  a  deterministic  integral  kernel 
which  relates  the  statistical  measures  of  the  system  output  to  those  of 
its  input.  In  this  context,  C( *,•,*,*  )  relates  the  second  order  statis¬ 
tics  of  y(*,')  to  the  properties  of  probing  signal  s(*). 

The  four-fold  integral  (4.6-5)  is  the  tensor  representation  of 
0(  *,*»*»*,)  with  respect  to  {$!<(•)}.  Therefore,  Equation  (4.6-5)  can 
be  interpreted  as  a  parameterization  of  G( *,*,',*)  with  respect  to  the 
Karhunen-Loe ve  basis  {<t>k(*)}. 

This  result  is  significant  for  several  reasons.  First,  it  nails 
down  the  meaning  of  the  somewhat  nebulous  expression,  "Identification  of 
the  channel  operator  £(•)."  The  cross-correlations  of  the  matrix 
representation  of  £(•)  can  be  interpreted  as  a  parameterization  of  the 
verv  important  stochastic  Green's  function  G( •,*,•,*) .  Estimating  the 
numerical  values  of  these  parameters  represents  a  systematic  approach  to 
stochastic  operator  identification.  Furthermore,  this  approach  allows  a 
priori  knowledge  of  the  statistical  properties  of  £(•)  to  be  incorpora¬ 
ted  into  the  estimator-correlator  structure.  For  example,  it  mav  N 
known  that  £(*)  belongs  to  a  certain  class  of  channels  character’.;-*  ■  • 
certain  forms  of  G( •,*,•,* )  or  their  mathematical  omii va ! *• -  . 
scattering  function  representations  (5).  ‘•'ore  oft:*"-  ■  *• 

functions  are  parameterized  bv  one  or  more  vari.i-  .<  - 
particular  channel  under  examination.  1* 
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can  be  determined  a  priori,  or  if  it  is  possible  to  bound  their  values 
based  on  environmental  constraints,  this  knowledge  can  be  incorporated 
into  the  detector  through  Equation  (4.6-5).  Unfortunately,  this 
interesting  problem  is  beyond  the  scope  of  this  study,  and  it  is  left 
as  an  issue  which  deserves  further  research. 

4.7  Conclusions 

The  solutions  for  _0( • , • )  and  _G( • , • )  were  derived  and  studied  in 
detail.  Spectral  representations  for  the  covariance  and  matrix  filter 
kernels  were  used  to  obtain  series  solutions  for  the  processor  equa¬ 
tions.  We  pointed  out  the  relationship  between  these  representations 
and  the  Karhunen-Loeve  expansion.  In  addition,  it  was  seen  that  this 
approach  sheds  light  onto  several  relevant  theoretical  issues  and  sug¬ 
gests  how  to  implement  the  optimal  processor. 

The  second  order  statistical  properties  of  £(•)  are  related  to  the 
{A<y)}  parameters.  In  principle,  the  channel  identification  process 
could  be  developed  based  on  this  relationship.  However,  this  method  has 
a  fundamental  limitation.  The  purpose  of  this  study  is  to  develop  a 
channel  identification  scheme  which  can  be  incorporated  into  a  practical 
array  processor.  Simplicity  is  of  the  essence  in  these  applications, 
and  clearly,  the  relationship  between  _Ry(’,‘)  and  £(•)  presented  in  the 
last  section  is  much  too  complicated  even  if  _Ry  is  diagonal.  A  more 
compact  representation  for  £(•)  is  required.  This  problem  will  be 
examined  in  the  next  chapter. 


Chapter  5 


A  CANONICAL  EXPANSION  APPROACH  TO  STOCHASTIC 
SYSTEM  MODELING  AND  IDENTIFICATION 


5.1  Introduction 

This  chapter  examines  the  modeling  and  identification  problem  in 
further  detail.  In  particular,  we  seek  a  more  compact  representation 
for  £(•)  in  order  to  simplify  Equation  (4.6-3).  Clearly,  simplifying 
this  expression  is  essential  if  the  estimator-correlator  is  to  be 
implemented.  Of  course,  the  requirements  which  were  spelled  out  at  the 
beginning  of  Chapter  4  must  still  be  met.  The  representation  for  £( • ) 
must  be  suitable  for  digital  signal  processing  applications.  Also, 
the  channel  identification  should  be  signal  independent.  Once  £( • )  is 
identified  using  a  probing  signal  s(*),  we  should  be  able  to  estimate 
the  statistical  measures  of  jr( • , • )  for  any  arbitrary  transmitted 
signal.  Finally,  the  representation  should  allow  a  priori  knowledge  of 
£(•)  to  be  incorporated  into  the  detector  structure. 

The  derivation  is  presented  in  the  next  section.  The  solution 
immediately  leads  to  a  convenient  series  representation  for  £(•) 
developed  in  Section  5.3.  These  results  are  new  and  represent  an 
original  contribution  to  stochastic  system  theory.  In  addition,  they 
provide  interesting  insights  into  several  classical  system  identifica¬ 
tion  theory  issues.  Using  this  representation,  we  obtain  a  very  simple 
expression  equivalent  to  Equation  (4.6-3).  It  establishes  an  interes¬ 
ting  connection  among  detection  theory,  estimation  theory,  and  sto¬ 
chastic  system  identification  theory  which  is  examined  in  Section  5.5. 


5.2  Derivation 


In  order  for  the  results  to  be  as  general  as  possible,  let  us  pose 
the  modeling  and  identification  problem  in  terms  of  a  random  Green's 
function  representation  for  £(•),  an  input  process  x( • , • ) ,  and  an  output 
process  y( • , • ) : 

y(t,  to)  =  £x(t,  to)  =  /  h(t,  t,  to)  x(t  ,  to)  dr  (5.2-1) 

T 

x(*,*)  is  a  stochastic  or  deterministic  probing  signal  whose  properties 
are  known  and  under  our  control.  The  final  goal  is  to  identify  statis¬ 
tical  measures  of  h( • , • , • )  in  terms  of  the  known  properties  of  x(*,*) 
and  the  measurable  properties  of  y(*,*).  We  shall  assume  zero  initial 
conditions,  which  is  reasonable  in  the  context  of  stochastic  trans¬ 
mission  channel  identification. 

Canonical  expansions  of  the  input  and  output  covariance  kernels 
will  be  used  to  identify  the  stochastic  system.  Let  { 4»lc ( * ) )  be  an 
orthonormal  basis  simultaneously  solving  the  following  expressions: 

XkY)  V0  =  /  Ry(t’  tl)  W  dtl  (5.2-2) 

Xk°  *k(t)  “  /  Rx(t*  t2)  ♦k(t2)  dC2  (5.2-3) 

T 

In  general,  it  is  difficult  to  find  simultaneous  solutions  to  these 
eigenvalue  equations.  One  approach  is  based  on  a  generalized  eigenvalue 
decomposition  of  Ry(*,*)  and  Rx(*,*)  which  will  be  described  in  detail 
later  In  the  dissertation.  There  are  numerically  robust  algorithms 


available  for  solving  the  factorization  problem.  Furthermore,  in  a 
number  of  meaningful  special  cases,  simultaneous  solutions  to  Equations 
(5.2-2)  and  (5.2-3)  exist.  For  example,  if  the  input  process  x( • , • )  is 
wide  sense  stationary,  and  the  random  Green's  function  h( *,*,•)  is 
time-invariant,  then  complex  exponentials  satisfy  the  equations.  In 
addition,  if  either  the  input  or  output  process  can  be  approximated  by  a 
white  noise  process,  then  the  problem  reduces  to  solving  only  one  of  the 
eigenvalue  equations.  This  approximation  is  reasonable  since  the 
probing  signal  s(*)  is  under  our  control.  Good  probing  signal  design 
can  be  exploited  to  simplify  the  representation  and  identification  of 
£(•). 

Next,  recall  that  the  system  output  y( • , • )  is  given  by 

y(t,  cd)  =  /  h(t,  r,  ui)  x(t,  td)  dr 
T 

We  shall  make  the  physically  realistic  assumption  that  h( •,*,•)  and 
x(  • ,  • )  are  uncorrelated.  In  terms  of  Equation  (5.2-1),  the  system 
output  covariance  kernel  Ry(*,*)  is: 

R  (t  ,  t  )  =•  E{[  /  h(t  ,  t,  id)  x(t,  cd)  dr  ]  [  /  h(t?,  s,  u>)  x(s,  id)  ds  ]  * } 
v  1  ^  T  1  t  ^ 

Writing  Ry(*,«)  in  terms  of  expectations  over  the  individual  ensembles 


[«I>; 


e{x(t,  to)  x*(s,  to)}  “  RX^T»  s)  =  l  X£x'  4>k( T )  $k(s) 

k*l 


and,  of  course, 


Ry(tl’  t2)  ■  J  XkY)  *k(tl*  *k(t2} 
k*l 


There  fore. 


W  t2)  *ky>  W  <(t2> 


Eh{  I  Ak  I  h(tl>  T»  <frk(t)  dr  f  h*(t2,  s,  to)  <ji*(s)  ds} 

k=1  T  T  (5.2-4) 


Equating  the  kth  terms  in  the  expansion  yields: 


/  h(t  j ,  t,  to)  <f»k(T)  dx  *  hk(u>)  <(>k( t  j ) 


(5.2-5) 


/  h*(t,,  s,  to)  4>*(s)  ds  -  h*(to) 


Notice  that  the  two  preceding  relationships  are  eigenvalue  equations 
for  h( *,*,•),  which  implies  that  in  essence,  a  Karhunen-Loeve  expansion 
of  h( •,♦,*)  is  being  performed.  Substituting  these  relationships  into 
Equation  (5.2-4)  and  equating  the  k-th  expansion  coefficients  gives: 
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,  ^  Eh  <\<“>  <<“»  W 

^  X<x)  E{  |hk(03)  1 2}  ^(tj)  *J(t2)  (5.2-6) 

( y ) 

he  identification  problem.  The  eigenvalues  (X^  }  are 

itimated  from  the  output  measurements  y(‘,*).  Further- 
ues  {X^  }  are  known  because  the  process  x( • , • )  is 

Therefore : 

E{|hk(u>)i2}  =  X<h)  =  X<y)/X<x)  (5.2-7) 


Vu) 


/  /  h(t,  t,  u)  4>*(t)  4> . ( x >  dt  dx 


(5.3-2) 


and  the  functions 


t)  =  <J>±( t )  <J»*( t) 


form  an  orthonormal  basis  for  the  product  space  T  x  T.  The  coefficients 
{hij(*)}  are  the  matrix  representation  of  the  random  Green's  function 
h( •,*,•),  or  from  a  more  abstract  point  of  view,  the  matrix  representa¬ 
tion  of  stochastic  operator  £(•)• 

From  Equation  (5.2-5): 


/  h(t ,  T,  w)  <j>.(t)  dr  =  h  (w)  <)>.(t) 


which  implies  that 


hij(w)  =  /  hj(w)  4>j(t)  4>*(t)  dt  =  hj(w) 


(5.3-3) 


This  result  shows  that  the  matrix  representation  of  £(•)  is  diagonalized 
provided  the  covariance  functions  of  x(*,*)  and  y( • , • )  are  simultan¬ 
eously  diagonalizable .  Therefore,  the  random  Green's  function  h( • , * , • ) 
can  be  represented  by  a  single  sum  series: 


h(t,  T,  u>)  -  l  h  (w)  $  (t)  4>*(t) 


(5.3-4) 
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The  second  scheme  to  be  considered  is  based  on  the  representations 
for  £(*,•)  and  £(  • ,  • )  derived  in  Section  3.4,  where 


—  M 

xU.  to)  =  l  l  x  .  ( ui )  4>  (t)  e 
i=l  j=l  1J  J 


(5.4-5) 


00  N 


^(t,  to)  =  l  l  y.  .(to)  .  ( t )  £ 

i=l  j=l  1J  J 


(5.4-6) 


Assuming  the  { y l j ( * ) )  coefficients  are  uncorrelated,  then 


00  n 


R  (t,  to)  -  l  l  A^  4<i ( t )  4>*(u)  £  e* 
y  i=l  j=l  J  J  J 


(5.4-7) 


But  for  a  fixed  i, 


I  ‘Jfi,  1,  ■  »U>  A.<lr>  UH(1) 

J-l  3  J  J 


(5.4-8) 


where  the  sun  is  written  in  matrix  notation: 


iKi)  =  [£i  ...  £JJ ] 


(5.4-9) 


,(y)  _  ..  „f,(y)  ,  (y), 

diagfA^  ...  A^^  ] 


(5.4-10) 


Substituting  (5.4-8)  into  (5.4-7)  leads  to  the  following  simplification 


R  (t,  u)  =  l  $  (t)  U(i)  A(y)  UH(i)  **(u) 
>  i-1 


(5.4-11) 


which  will  be  used  in  the  subsequent  derivation. 


4>k<c )  U(k)  A^y)  UH(k)  4>*(u)  =  <j,k(t)  U(k)  A.[h)  A_£x)UH(k)  *J(u) 


for  all  k,  and 


.(h)  (y)  A(x)-1 

4  4  4 


(5.4-16) 


Each  matrix  in  Equation  (5.4-16)  is  diagonal;  therefore,  the  identifi¬ 
cation  is  easy  to  perform.  Moreover,  the  representation  of  H( *,",*) 
with  respect  to  (£jj ( * )}  reduces  to: 


oo  n 


H(t,  r,  oi)  =  l  l  h  (<o)  (t)  <p  (t)  £  £ 

i=l  j=l  3  3 


(5.4-17) 


which  is  the  matrix  equivalent  of  (5.3-4). 


5.5  Discussion 


The  results  from  Sections  5.2  through  5.4  show  how  Karhunen-Lceve 
expansions  can  be  applied  to  the  system  modeling  and  identification 
problem.  Expanding  the  random  Green's  function  in  terms  of  the 
Karhunen-Loeve  basis  is  a  fundamental  approach  to  stochastic  system 
modeling.  It  simplifies  the  identification  problem,  and  gives  a 
representation  for  £(•)  which  can  easily  be  incorporated  into  the 
estimator-correlator  structure.  One  can  interpret  the  results  as  a 
transformation  of  the  matrix  parameters  from  an  arbitrary  basis  into 
the  Karhunen-Loeve  basis,  where  the  identification  is  easier  to  perform. 
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The  results  provide  several  interesting  new  insights  into  the 
stochastic  system  identification  problem.  Applying  Equation  (5.2-7)  to 
the  original  problem  of  simplifying  Equation  (4.6-3)  yields 

Xly)  =  E0\(“)!2}  4S)  (5.5-1) 

( s ) 

where  {A^  }  are  the  eigenvalues  of  _P.  This  result  is  deceptively 

simple  looking;  actually,  it  ties  together  ideas  from  several 
disciplines.  The  representation  for  £(•)  was  derived  in  order  to 
simplify  the  relationship  between  the  {A^y^}  terms  and  the  { S ^ }  terms, 

K  iC 

in  other  words,  to  simplify  the  channel  identification  problem.  The 
{A^7  }  terms  are  needed  to  calculate  the  minimum  mean-square  estimate  of 
the  channel  output  used  to  compute  the  likelihood  ratio.  Of  course,  the 
key  to  obtaining  (5.5-1)  is  simultaneous  diagonalization  of  _P  and  Ry, 
which  in  this  context  depends  on  proper  design  of  the  probing  signal 
s(-),  or  in  the  case  of  more  general  signals,  the  use  of  numerically 
robust  algorithms  solving  a  generalized  eigenvalue  problem.  The  repre¬ 
sentation  for  £(•)  meets  the  requirements  defined  at  the  beginning  of 
the  chapter.  It  is  well-suited  for  digital  processing  of  the  array 
data,  and  it  allows  a  priori  knowledge  of  the  scattering  channel  to  be 
incorporated  irto  the  estimator-correlator  structure. 

Equation  (5.5-1)  also  gives  new  insight  into  the  meaning  of  the 
phrase,  "For  system  identification,  a  probing  signal  must  be  suffici¬ 
ently  rich  [37]  [38].“  In  terms  of  Equation  (5.5-1),  it  means  that  for 

( y )  C  s ) 

each  eigenvalue  A ^  .  the  probing  signal  eigenvalue  A^  must  be  large 

enough  so  that  numerical  errors  do  not  occur  while  performing  the 


Chapter  6 


NUMERICAL  ISSUES 

6. 1  Introduction 

At  this  point  in  the  dissertation,  the  mathematical  derivations 
needed  to  solve  for  the  processor  kernels  have  been  completed.  We  have 
asserted  that  orthonormal  representations  of  the  covariance  kernels,  the 
processor  kernels,  and  stochastic  operator  £(•)  can  be  used  to  obtain 
the  solutions.  However,  several  outstanding  issues  still  must  be 
resolved  before  the  optimal  structure  can  actually  be  implemented.  In 
particular,  we  have  yet  to  show  how  the  Karhunen-Loeve  eigenfunctions 
and  eigenvalues  can  be  calculated  from  array  measurements.  This  problem 
must  be  examined  in  detail  if  we  are  to  go  beyond  formal  manipulations 
pf  infinite  series  to  a  working  system.  In  conjunction  with  this  issue, 
the  numerical  difficulties  inherent  in  any  adaptive  signal  processing 
system  must  be  overcome.  How  do  numerical  difficulties  arise? 

The  answer  to  this  question  relates  to  the  nature  of  adaptive 
systems.  Recall  that  an  adaptive  system  is  a  learning  or  self- 
optimizing  machine  which  adjusts  its  response  according  to  the  statis¬ 
tical  properties  of  its  surroundings  [40] .  This  is  where  numerical 
difficulties  can  occur.  For  example,  second  order  statistical  informa¬ 
tion  is  usually  estimated  by  post-multiplying  a  data  matrix  by  its 
Hermitian  transpose,  which  causes  loss  of  numerical  precision  when  the 
arithmetic  operations  are  carried  out  on  finite-precision  hardware.  The 
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ability  to  implement  an  equivalent  adaptive  processor  in  hardware 
without  performing  the  squaring  operation  is  very  desirable. 

Since  a  priori  knowledge  of  the  covariance  kernels  is  incomplete, 
they  must  be  estimated  from  array  data.  Therefore,  the  numerical 
problems  as  described  above  must  be  taken  into  account  while  designing 
the  processor. 

The  purpose  of  this  chapter  is  to  solve  the  computational  and 
adaptive  implementation  problems  required  to  construct  the  optimal  pro¬ 
cessor.  We  begin  by  returning  to  the  inverse  noise  covariance  kernel 
Q(  • ,  • ) ,  and  suggest  an  approach  for  computing  the  Karhunen-Loeve  basis. 
The  solution  requires  calculating  matrix  products,  an  operation  which 
should  be  avoided  whenever  possible.  The  singular  value  decomposition 
can  be  used  to  solve  an  equivalent  estimation  problem,  bypassing  the 
squaring  step  altogether. 

Next,  we  shall  turn  our  attention  to  the  estimator  branch  _G(  •,*) . 
The  solution  is  based  on  simultaneous  diagonalization  of^y(‘,')  and 

using  generalized  eigenvalue  decomposition.  Once  more,  a  matrix 
squaring  operation  appears  in  the  solution  formulation.  Can  an  equiv¬ 
alent  processing  system  be  realized  without  squaring? 

The  answer  to  this  important  question  is  yes,  and  the  processing  is 
based  on  a  matrix  decomposition  which  is  just  now  appearing  in  the 
numerical  signal  processing  literature  [41].  It  is  called  the  CS 
(cosine-sine)  decomposition,  and  it  provides  the  means  to  compute  _G( • , • ) 
directly  from  array  data,  making  the  matrix  multiplication  unnecessary. 
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Furthermore,  it  will  be  proved  that  this  solution  is  numerically 
equivalent  to  decorrelating  the  expansion  coefficients  of  n(*,*)  fol¬ 
lowed  by  a  Karhunen-Loeve  transformation. 

Finally,  we  will  show  how  to  obtain  the  data  which  are  needed  to 
estimate  Rjj(  • ,  • ) ,  Ri(*,*),  and  _Ry(*,*).  It  will  be  seen  that  _Ry(*,*) 
must  be  obtained  at  a  high  signal-to-noise  ratio,  as  this  step 
represents  system  identification. 

6.2  The  Inverse  Noise  Covariance  Kernel 
6.2.1  Example 

The  computational  issues  involved  with  implementation  can  be 
illustrated  by  considering  the  following  example.  Suppose  that  _n ( • , • ) 
is  a  wide-sense  stationary,  periodic  stochastic  process.  Find  the 
Karhunen-Loeve  basis  and  Q( • , • ) . 

We  begin  by  selecting  a  basis  in  the  form  derived  in  Section  3.4: 

=  ^(t)  (6.2-1) 

for  j  =  1,2...,  M,  where  M  is  the  number  of  sensors,  and  for  all  i. 

_n ( • ,  • )  can  be  expanded  as  follows: 

00  N 

n(t,u))  =  l  l  n  (w)  (fi  (t)  u 

i—  j  =  l  1J  J 

and  Q( • , • )  has  the  representation: 
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The  basis  elements  must  be  orthogonal: 


<5,,  6.. 

ik  ji 


and  the  expansion  coefficients  must  be  uncorrelated 


For  a  given  m,  Equation  (6.2-4)  becomes: 


A  u  =  /  Rj..(t)  e  -'mT  u  dr  =  S..(mwn)  u 

mn— n  1  — N  — n  — N  0  — n 

—<o 

where  J5{j(  • )  is  the  power  spectral  density  matrix  of  _n(  • ,  • ) ,  and  wg  is 
equal  to  one.  Therefore,  the  equation 


\  u 
mn-m 


(m)u 
— n 


(6.2-5) 


must  be  solved  for  eigenvalues  {Amn}  and  eigenvectors  {u^}  to  obtain 
the  Karhunen-Loeve  basis.  Actually,  since  the  eigenvectors  {u^}  are 
also  a  function  of  m,  Equation  (6.2-5)  should  be: 


A  u 
mn  — nn 


(m)  u 
— mn 


(6.2-6) 


Solving  Equation  (6.2-6)  provides  the  Karhunen-Loeve  basis.  Of  course, 
the  wide-sense  stationary,  periodic  signal  assumptions  were  only  made 
for  purposes  of  illustration.  The  preceding  results  can  be  general¬ 
ized  to  nonstationary  processes  as  well. 


6.2.2  A  Possible  Processing  Scheme 

The  results  from  Section  6.2.1  suggest  a  processing  scheme  illus¬ 
trated  in  Figure  6-1.  Since  the  scalar  basis  is  the  set  of  complex 
exponentials,  the  first  step  is  to  compute  the  discrete  Fourier  trans¬ 
form  of  the  array  data  given  noise  alone.  Next,  the  power  spectral 
density  matrix  at  each  harmonic  mug  is  estimated  by  averaging  over  the 
array  data.  Finally,  the  vector  basis  functions  are  calculated  bv 
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performing  an  eigenvalue  decomposition  of  each  estimated  power  spectral 
density  matrix. 

While  this  sequence  of  steps  is  mathematically  correct,  it  is  not 
advisable  for  the  reasons  given  in  Section  6.1  [42],  The  power  spectral 
density  matrix  estimation  involves  a  matrix  squaring  operation  prior  to 
the  eigenvalue  decomposition  step.  Since  this  reduces  the  precision  of 
the  final  answer,  an  equivalent  procedure  avoiding  this  step  is  needed. 

This  is  where  the  singular  value  decomposition  shall  be  introduced. 

6.2.3  Applying  the  Singular  Value  Decomposition 

The  loss  of  numerical  precision  can  be  avoided  if  the  singular 
value  decomposition  (SVD)  [39]  is  used  instead  of  eigenvalue  decomposi¬ 
tion.  The  first  step  in  an  equivalent  processing  operation  is  to 
calculate  the  discrete  Fourier  transform  as  before.  Next,  consider  a 
sequence  of  vectors  fr(k)}.  Arrange  them  sequentially  in  the  N  x  L 
matrix  A(k) : 

A(k)  =  [_r j C k )  £20O  ...  £L(k)]  (6.2-7) 

where  £i(k)  is  the  i-th  M  x  1  vector,  and  L  is  the  number  of  measure¬ 
ments.  The  notation  is  simplified  by  dropping  the  notation  k.  Then  A 
has  the  following  decomposition: 

A  *  D  E  VH  (6.2-8) 

where  JJ  is  an  M  x  M  unitary  matrix,  £  is  an  L  x  L  unitary  matrix,  and  £ 
is  a  diagonal  matrix  whose  elements  are  the  singular  values  of  A.  A  can 
be  written  in  terms  of  the  singular  values  { }  and  the  columns  of  U  and 
V: 
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.  v  H 

-  =  >,  ai 

i=l 


(6.2-9) 


r  is  the  number  of  non-zero  singular  values  of  A,  which  is  also  the  rank 
of  A.  Furthermore,  the  following  relationships  between  the  columns  of 
U,  columns  of  V,  and  A  are  met: 


A  Am 


°ra  um 


A^  Hm  =  am  Am 


(6.2-10) 

(6.2-11) 


An  eigenvector-eigenvalue  relationship  is  found  by  eliminating  ^  from 
Equations  (6.2-10)  and  (6.2-11): 


A  AH  u  =  a2  u  (6.2-12) 

- — m  m  — m 


H  2 

A  A  is  a  Hermitian  matrix,  and  0^  =  \  . 

-  ram 

The  point  of  this  discussion  is  that  the  eigenvalue  decomposition 

can  be  calculated  by  solving  an  equivalent  singular  value  decomposition. 

The  power  spectral  density  matrix  estimates  are  given  in  the  form 


A(k)  AH(k) 


(6.2-13) 


at  each  frequency  k.  It  is  clear  from  (6.2-10)  and  (6.2-12)  that  the 
eigenvectors  of  this  product  are  the  same  as  the  left  singular  vectors 
of  A(k).  Therefore,  the  eigenvectors  and  eigenvalues  of  (6.2-13)  can  be 
obtained  directly  from  transformed  array  data  by  calculating  a  singular 
value  decomposition.  The  procedure  is  illustrated  in  Figure  6-2. 
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6.3.2  Basic  Numerical  Approach 


In  Section  4.4  we  demonstrated  how  the  solution  for  G( • , • )  was 
constructed  by  data  prewhitening  followed  by  an  eigenvalue  decomposi¬ 
tion  of  the  modified  covariance  kernels  R.' (*,•)  and  R  *  ( *  ,•).  Here,  the 

—1  — y 

problem  will  be  approached  from  a  different  perspective.  What  we  seek 
is  a  linear  transformation  which  diagonalizes  _R  ^  ( * » * )  and  _Ry(*,*) 
directly,  skipping  the  prewhitening  step  entirely.  In  terms  of  matrix 
representations,  this  problem  can  be  posed  as  follows.  Find  a  linear 
transformation  X  such  that  _Rj  and  _Ry  are  diagonalized  simultaneously 


_X  =  diagtaj  c*2  ...  ] 

— H  JLy  ii  =  diag  ( 8 1  $2  •  •  •  ) 


(6.3-2) 


(6.3-3) 


Provided  a  non-singular  X  exists,  then: 


R.1  =  X“H  Aa  if1 


where  the  -H  notation  denotes  inversion  followed  by  Hermitian  transpose, 
and  similarly, 


Ry  =  Fh  Ab  2Tl 


Since  G  is  given  by: 


G  =  R  r'1 
-  -y  -1 


in  terms  of  X.  _A0,  and  _Ag,  is  written: 
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G  =  R  rT1  =  X~H  h  X_1  X  A  1  XH  =  X  H  A„  A-1  XH 

—  — y  —  1  —  -HI  —  —  — a  —  —  -d5  — a  — 


X  H  diajslBj/aj  ...  ]  XH 


(6.3-4) 


The  necessary  and  sufficient  conditions  for  simultaneous  diagonali- 


zation  of  two  positive  definite,  Hernitian  matrices  are  well  under¬ 


stood  [43].  It  can  be  shown  that  there  exists  a  non-singular,  unique 


matrix  _X  which  diagonalizes  _Rj  and  _Ry.  Furthermore,  the  computational 


aspects  of  this  problem  have  been  studied  extensively  by  numerical 


analysts.  In  this  field,  the  problem  is  called  the  generalized  eigen¬ 


value  problem  of  the  matrix  pencil  J*1  “  A  Ry  [39],  The  fact  that  the 


columns  of  _X  correspond  to  the  generalized  eigenvectors  of  Rj  and  JRy 


can  be  demonstrated  by  rewriting  Equations  (6.3-2)  and  (6.3-3): 


Rl  X  =  X-H  diagfaj  a2  •••  1 


Ry  X  =  X“H  diag [8 1  S2  ...  1 


and  by  defining  E  =  X 


_Rl  L  ~  J-  diaglaj  <*2  •••  ) 


Ry  _X  =  _E  diag[3i  32  •••  1 


For  the  kth  columns  of  X  and  E: 
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J*y  ilk  “  ®k  £k 
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Rearranging  these  relations  leads  to  the  following: 

Jil  21k  =  ( ak  /  $k )  Ry  21k  =  ^k  Hk 

which  shows  that  the  rows  of  X  are  the  generalized  eigenvectors  of 
and  Ry,  and  that  the  {a^/Bk.}  terms  are  the  generalized  eigenvalues  of  Rj 
and  _Ry. 

The  preceding  discussion  suggests  that  G( • , • )  can  be  calculated 
directly  by  solving  a  generalized  eigenvector  problem,  where  _G(  • ,  • )  is 
expressed  directly  in  terms  of  a  generalized  spectral  representation. 

But  actually,  this  solution  is  equivalent  to  that  presented  in  Section 
4.4.  This  can  be  seen  by  defining  a  filter  operator  C  as  the  matrix 
performing  Gram-Schmidt  orthogonalization  of  the  noise  process  Fourier 
coe  f ficients : 


n  '(at)  =  C  n(w) 


(6.3-5) 


where 


E{n'(u)  nfH(u)}  =  I 


C  can  be  written  in  upper  triangular  form,  and  given  a  positive 
definite,  Hermitian  noise  covariance  kernel  Rjj,  C  is  unique  [44], 
Now,  when  the  array  data  _r ( •  )  are  prewhitened  by  _C: 


r'(u)  =  C  r(w)  =  C  v(w)  +  C  n(u)  =  y'(oj)  +  n'(w) 
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the  covariance  kernels  of  r ' ( ■ )  and  y'(*)  are: 


R|  =  E{£'(oj)  _r'H(to)}  =  C  CH  =  C  CH  +  I  =  +  J. 

Both  R*  and  r'  are  positive  definite  and  Hermitian,  their  eigenvectors 
—1  — y 

are  the  same,  and  they  are  unique: 


Rj  =  U  Aj  U 


R  =  U  A  U 
— y  — y  — 


(6.3-6) 


(6.3-7) 


Rearranging  Equations  (6.3-6)  and  (6.3-7)  yields: 


UH  Rj  U  =  diaglA^'*  ...  ] 


UH  R '  U  =  diag [A^y  ^  . , 
—  —v  —  1 


but  in  terns  of  _Ri ,  R.y,  and  _C: 


UH  C  Ri  CH  II  =  diag[ A ^ 


UH  C  R  CH  U  =  diag[A^y  5 
- - y  -  -  1 


The  matrices  U  and  C  are  unique,  therefore: 
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XH  =  UH  C  =  0  R 


(6.3-8) 


which  equates  the  eigenvectors  of  R '  and  r'  with  the  orthonormal 

—  1  — y 

columns  _0  of  the  QR  factorization  of  _X  ,  and  the  whitening  filter  C 
with  matrix  _R.  The  {a^}  and  {8^}  terms  are  the  eigenvalues  of  _Rj  and 
R'  respectively.  Therefore,  solving  the  generalized  eigenvalue  problem 

— y 

is  equivalent  to  decorrelating  _n(  • )  followed  by  a  Karhunen-Loeve  trans¬ 
formation. 

6.3.3  Implementing  the  Estimator  pranch  with  the  CS  Decomposition 

We  have  seen  how  to  implement  the  estimator  branch  using  the  gener¬ 
alized  eigenvalue  decomposition,  and  in  principle,  this  part  of  the  pro¬ 
blem  is  solved.  In  this  section,  we  consider  how  implementation  can  be 
accomplished  in  a  numerically  robust  fashion. 

Maximum  likelihood  estimates  for  _R^  and  jly  can  be  constructed  by 
averaging  over  independent  array  measurements.  The  procedure  begins  by 
calculating  the  appropriate  Fourier  series  expansion  coefficients: 

ri(io)  =  <  £,  £1  > 

rk(u>)  =  <  £,  > 

and  so  on.  This  step  is  repeated  over  L  measurements,  and  a  data  matrix 
A  is  constructed: 

AH  =  [jr  i  £2  •  •  •  £k  •  •  •  I 

The  kth  column  of  contains  the  L  coefficients  calculated  with  respect 
to  the  kth  basis  function.  Finally,  the  estimated  covariance  matrix  _R 
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-  H 

R  =  A  A 


(6.3-9) 


The  signal  alone  covariance  kernel  must  also  be  estimated  from 
array  measurements.  Of  course,  the  signal  £ s_(  • )  is  always  obscured  by 
additive  measurement  noise,  meaning  that  it  is  not  possible  to  estimate 
_Ry  directly  from  data.  However,  if  the  signal-to-noise  ratio  is  large 
enough,  the  array  measurements  can  be  made  close  to  signal  alone: 

_r(to)  =  ^(oj)  +  _n(w)  a 

Since  the  probing  signal  is  under  our  control,  high  signal-to-noise 
ratio  measurements  can  be  obtained  by  adjusting  the  probing  signal 
energy.  By  repeating  the  processing  steps  described  above,  a  data 
matrix  B  which  approximates  signal  alone  measurements  is  formed: 


bH 
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and  _Ry  is  estimated  by 


«  H 
R  =  B  B 

-y  -  - 


(6.3-10) 


Finally,  the  generalized  eigenvectors  and  eigenvalues  can  be  calculated 
by  solving  the  system 


AH  A  xr  =  *k  2Sk 


H 


(6.3-11) 


for  its  eigenvalues  and  eigenvectors. 

In  order  to  solve  Equation  (6.3-11)  without  squaring,  we  propose  a 
new  approach  which  is  just  now  becoming  known  in  the  signal  processing 
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community  [41]  [45].  The  solution  uses  the  CS  decomposition,  which 
will  be  described  in  the  next  section. 

6.3.4  Solution 

The  CS  (cosine-sine)  matrix  decomposition  arises  naturally  in  the 
context  of  the  A**  A  -  X  _BH  II  generalized  eigenvalue  problem.  This 
system  can  be  solved  by  CS  decomposition  directly  in  terms  of  A  and  _B. 
Suppose 
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(6.3-12) 


is  the  OR  decomposition  of  A  over  JR.  Writing  _0i  and  J)2  in  terms  of 


their  singular  value  decompositions  gives 

si  =  ja  C  V 
2.2  =  S  V 


(6.3-13) 


(6.3-14) 


where 


C  =  diag(ci) 


and 


S  =  diag(si) 

The  {o^}  and  (£i)  terms  are  non-negative,  Uj ,  U2*  atld  V  are  unitary,  and 

C2  +  s2  =  I 


v- 

V.VA-V, 

• _ = 

,  • 


.  >> 

,4*:^ 

^.•w- 

>  •/ ,*■  _* 
<y- .*■  -V 

VvV'  .• 

^•Aw 


.■* -v . 

.  •  .  «  *  1 *r  *  * 

-  v.v>.v 
■  v./.v 


By  setting 


l 


then 


XH  (A.H  _A)  _X  =  diagfcq  ...  a^j) 
XH  (bH  B)  X  =  diag[Si  ...  BN] 


and  it  follows  that  _X  is  the  matrix  simultaneously  diagonalizing  A^  A 
and  J3h  _B.  Therefore,  using  this  approach,  the  estimates  for  X,  the 
generalized  eigenvalues,  and  £  can  be  calculated  directly  from  array 
data  without  performing  squaring. 

The  preceding  discussion  leads  to  the  following  processing  steps 
needed  to  calculate  G:  Form  data  matrices  A  and  B  based  on  appropriate 
Fourier  series  representations,  such  as  the  discrete  Fourier  transform. 
Next,  calculate  the  QR  decomposition  of  the  _A  over  _B  matrix.  Third, 
calculate  the  singular  value  decompositions  of  Oj  and  O2.  Fourth,  form 
X  from  the  _V  and  _R  matrices  obtained  in  steps  three  and  one  respec¬ 
tively.  Finally, 


G  =  X~H [ STS ] [CTC]-1  XH 


(6.3-18) 
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6.4  Adaptive  Covariance  Kernel  Estimation 

Several  details  involved  with  covariance  kernel  estimation  must  be 
worked  out.  Since  these  estimates  depend  on  array  data,  it  is  clear 
that  the  proper  information  must  be  used  to  construct  the  estimates. 

For  example,  R^j(  •  ,  • )  must  be  constructed  from  measurements  Riven  no 
signal  present.  Similarly,  both  signal  and  noise  should  be  present  in 
the  measurements  used  to  generate  data  matrix  A.  However,  these  steps 
present  a  dilemma,  since  the  processor  is  designed  to  perform  detec¬ 
tion.  How  can  the  covariance  kernel  estimates  be  constructed  without  a 
priori  knowledge  of  the  correct  hypothesis? 

The  answer  to  this  question  relates  to  the  fundamental  nature  of 
adaptive  processing  systems.  Building  the  covariance  kernel  estimates 
is  a  learning  step  in  which  a  priori  knowledge  of  the  underlying  data 
structure  must  be  known.  Therefore,  in  order  to  determine  Rjj(*,*)  and 
Rq (*,*),  we  will  assume  that  a  priori  knowledge  of  the  correct 
hypothesis  is  available. 

In  the  active  detection  problem,  one  can  use  the  scheme  in 
Figure  6-3  [4].  Here  T'  is  the  return  travel  time  of  the  probing  sig¬ 
nal,  and  T  is  once  again  the  observation  interval.  The  noise  covariance 
kernel  estimate  can  be  formulated  during  the  interval  T',  and  used  in 
subsequent  detection.  The  signal  plus  noise  kernel  is  estimated  over 
the  tine  interval  T. 

Calculating  the  signal  alone  kernel  is  not  quite  as  straight¬ 
forward,  since  signal  alone  measurements  are  not  available.  However, 
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the  probing  signal  s(*)  is  under  our  control,  and  if  its  energy  is  large 
enough,  we  can  argue  that: 

jr(t,  w)  =  £s_(t)  +  _n(  t ,  to)  *  £ s_(  t )  (6.A-1) 

and  by  proper  scaling  of  the  data  measurements ,  the  kernel  _Ry(*,') 
corresponding  to  the  data  measured  over  the  time  interval  T  can  be  esti¬ 
mated.  Clearly,  a  high  signal-to-noise  ratio  is  needed  during  this 
step.  This  is  reasonable  from  a  system  identification  point  of  view, 
since  _Ry ( • , • )  represents  the  second  order  statistics  of  an  unknown 
linear  system.  System  identification  is  fundamentally  a  high  signal-to- 
noise  ratio  process.  Of  course,  once  the  data  matrix  B  is  formed  at  a 
high  signal-to-noise  ratio,  the  solution  can  be  incorporated  into  the 
detection  process,  which  is  generally  carried  out  in  low  signal-to-noise 
situations. 

b.  5  Conclusions 

We  have  examined  the  adaptive  implementation  issues  in  some  detail 
and  have  proposed  processing  schemes  based  on  robust,  state-of-the-art 
numerical  algorithms.  Our  results  have  established  interesting  con¬ 
nections  between  several  canonical  matrix  decompositions  and  the  spec¬ 
tral  representations  of  _Q( • , • )  and  _G(  • , • )  presented  in  Chapter  A. 

It  is  clear  that  the  singular  value  decomposition  is  a  verv  useful 
numerical  processing  tool.  The  eigenvectors  and  eigenvalues  of  the 
estimated  power  spectral  density  matrices  were  obtained  directly  from 
data  matrices  without  performing  a  squaring  operation.  In  another  con¬ 
text,  this  decomposition  could  be  used  to  estimate  covariance  matrix 
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eigenvectors  from  array  measurements.  The  advantages  of  this  approach 
are  well  documented  [46]. 

We  proposed  using  the  CS  decomposition  to  solve  for  G(  •  ,  • )  numer¬ 
ically.  This  method  is  equivalent  to  noise  coefficient  decorrelation 
followed  by  a  Karhunen-Loeve  transformation.  Furthermore,  the  CS 
decomposition  solves  for  the  generalized  eigenvectors  and  eigenvectors 
of  and  _Ry ( • , • )  directly  from  data  matrices,  once  again  making 

matrix  multiplication  unnecessary. 

Finally,  we  showed  how  to  adaptively  estimate  the  covariance 
kernels  from  array  measurements.  This  part  of  the  problem  must  be 
interpreted  as  a  learning  procedure  performed  prior  to  detection. 

Ry( • , • )  must  be  obtained  at  a  high  signal-to-noise  ratio,  since  this 
represents  a  system  identification  procedure. 


ident 


The  covariance  matrix  of  y( • , • )  at  time  t  is: 


Ry  =  {[  l  K  lks(t'>}  i  I  b»  lps(c)]Hl 


=  E{  l  l  vk  vj  |s(t)|2}  =  l  l  vk  E{bkb*}  vj  |s(t) 


k=l  1=1 


k=l  £=1 


=  a2  V 

s  — 


where 


V  =  v  v  ...  v  ] 

-  -1  -2  -p 


(7.2-4) 


is  a  Vandermonde  matrix  whose  columns  are  the  steering  vectors  of  the 

2 

arriving  planar  wavefronts,  ag  is  the  signal  power,  and  R^  is  the 

correlation  matrix  of  the  { b^_  ( * ) }  coefficients.  Of  course,  the  wave- 

fronts  are  uncorrelated  if  E{ b  b*}  is  identically  zero  when  k  *  l. 

k  x. 

The  noise  covariance  matrix  is 


R,,  =  I  +  a“  V  R  V 
— N  w  —  N  — N  —BN  — N 


(7.2-5) 


where  a  is  the  sensor  noise  variance,  V„,  is  a  matrix  whose  columns  are 
w  — N 

the  steering  vectors  {_v  }  ,  and  R^  is  the  covariance  matrix  of  the  path 
coe  f  f icients  . 

Finally,  the  signal  plus  noise  covariance  matrix  Rj  is  the  sum  of 
_R_\  and  Rv ,  because  we  have  assumed  uncorrelated  signal  and  noise  compo¬ 
nents. 
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The  optimal  receiver  for  a  snapshot  of  data  is 


A(r)  =  Ar”1  -  R'Sr 


(7.2-6) 


and  the  estimator-correlator  form  of  A(*)  is 


A(£>  =  £)H  *v  V  £ 


(7.2-7) 


This  processor  is  seldom  applied  in  practical  array  processing 
problems.  In  practice,  when  the  arrival  directions  in  a  multipath 
channel  are  unknown,  a  set  of  closely  spaced  steering  vectors  is  used  to 
cover  the  sector  from  which  the  strongest  return  is  expected.  The 
steering  vectors,  multiplied  by  the  known  signal  s(t),  are  a  set  of 
spatial  filters  matched  against  deterministic  point  sources  immersed  in 
an  anisotropic  noise  field.  The  suboptimal  log-likelihood  ratio 


As(£)  =  £H  j^1  £(0)  s(t) 


(7.2-8) 


is  computed  for  each  steering  vector,  and  the  maximum  value  is  used  as 
the  test  statistic. 

Clearly,  it  is  easier  to  calculate  (7.2-8)  than  to  identify  _Ry  and 
implement  the  optimal  structure.  Is  optimal  processing  worth  the  added 
complexity?  In  order  to  answer  this  question,  a  meaningful  performance 
criteria  is  needed  by  which  the  structures  (7.2-7)  and  (7.2-8)  can  be 
compared  numerically.  This  will  be  addressed  in  the  next  section. 
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7.3  Derivation  of  the  Detection  Index 
7.3.1  Definition  and  Significance 

Ue  shall  use  the  detection  index  as  the  performance  criteria: 


E{  |  £  |  2  |  H^}  -  E{  |  £  I  2  |  Hq} 
E{ |£f2  |  Hq} 


(7.3-1) 


where  l  is  the  log-likelihood  functional.  This  is  a  meaningful  criteria 
for  two  reasons.  First,  the  detection  index  can  he  interpreted  as  the 
output  signal-to-noise  ratio,  which  means  that  the  processing  gain  of 
both  structures  can  be  calculated  and  compared.  Also,  it  can  be  shown 
that  A  relates  the  detection  probability  to  probability  of  false  alarm: 


(7.3-2) 


Clearly,  for  a  given  detection  probability,  A  should  be  as  large  as 
possible  to  minimize  the  probability  of  false  alarm.  Equation  (7.3-2) 
can  be  used  to  construct  receiver  operating  characteristics  for  the 
estimator-correlator  and  the  suboptimal  processor.  The  significance  of 
these  criteria  is  well  documented  [44]  [47], 


7.3.2  Derivation  for  the  Suboptimal  Processor 

First,  the  suboptimal  detection  index  will  be  calculated. 

Equation  (7.3-1)  can  be  evaluated  in  terms  of  the  channel  output 
£s  and  the  estimated  channel  output  _v .  The  latter  term  represents  the 


processing  signal  which  is  correlated  with  the  filtered  version  of  r(t). 


jif  w  w*  u^f  uw  \m  uyvtvi  v-w^nr  w  v^\rwvwvnf\rT\.r>»  v^r»^\rirvrwxrr\r»r\nT\r*\rwvwvwir»virvir 


The  likelihood  ratio  is 


*(r)  =  rH  Rn]  v 


(7.3-3) 


E(U|2  |  Hl}  =  E{|rH  R^1  £|  2}  =  E{  ( r"  r"1  £)*(rH  r"1  £)}  = 

E{0H  R'1  £  rH  R'1  (v)}  = 

R{  (y)H  R'1  (is)(fs)H  R-1  (y )}  +  E{  (y  )H  r"1  nn"  R"1  (v)}  (7.3-4) 
iH  —  N  —  —  — N  —  N  — 


The  first  tern  is: 


e<0H  *n1(*-)(£->H  RN!  (i)}  =  E{|(£s)H  r'1  ( v)  I  2}  (7.3-5) 


and  tiie  second  is: 


E{(£)H  R  1  nn"  (£)}  =  (y)H  R  1  (v) 


( 7.3-6) 


There  fore , 


K{\i\2  |  Hj}  =  E{|(£s)H  R,^1  (v)  |  2}  +  (v)H  R^,1  (v)  (7.3-7) 


Furthermore , 


Efhh  I  %}  =  E{  |rH  Rn10|2  |  H0)  =  E{(v)H  Rn*  nn"  r"1  (y)} 


H  „-l  __H  „-l 


V.v/0 

••.•••- 
v  v  y 

f  v  .■  v 


-V-' 

>v 

>>> 
■>  >  .V 
.  %.*■ 

>■>->- 


/.yy.v 


C-  •  •* 


*.*  V"  %’  N 

"  •.’■  v  V 
*  /  /  / 

•.  v  O  - 


;.vvv 

*  <“  O  V 

*>>&■ 


. 


<i>HV  <i> 


~WB 


>  ,>  V 

Cl 

;&#s3 


The  final  result  for  A  is: 


(7.3-8) 


E{  |(£s)H  R^(v)|2} 
(y)H  (v) 


This  expression  will  he  used  to  evaluate  suboptimal  processing 
performance . 

The  processing  signal  _v  used  in  (7.3-8)  is  the  scalar  signal  s(t) 
multiplied  by  a  steering  vector.  Denote  the  steering  vector  by  v(0). 
Then  Equation  (7.3-8)  becomes 


E{  |(£s)H  Rn1  v(0)|s(t)|2|2} 
vH(0)  R^1  v(6)  |s(t)|2 


(7.3-9) 


Evaluating  A  in  terms  of  the  measurement  model  (7.2-1)  and  the  correla¬ 
tion  matrix  _Rg  is  useful  for  computation.  Writing  the  numerator  of 
Equation  (7.3-9)  in  terms  of  the  measurement  model  gives: 


E{  |  (£s  )H  R^1  v(0  )  s(  t )  |  "}  =  F.{  j  [  l  b^  _v^  s(t)]H  R/  v(0  )  s(  t )  |  “} 

k=l 


Expanding  this  expression  yields 


E{  I  [  l  \  \  RnJ  v(0)|2  |  s ( t )  |  4] 


(7.3-10) 
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In  order  Co  express  Equation  (7.3-10)  in  terms  of  _Rp,  define  the 


Hermitian  form 


Ye  — k  — n1  ”(e) 


(7.3-11) 


E{lk^  bk  — k  — n'  ^(9)|Z}  =  E^J=1  bk  \0|Z}  = 


P  P 

E{  £  £  b  b*  v  v*  } 

L  L  k  £  k0  £9 


k=l  £=1 


(7.3-12) 


where,  for  simplicity,  we  have  dropped  the  oj  notation.  Rearranging 
Equation  (7.3-12)  gives 


P  P 


I  l  bp  Ye  ‘I 

k=l  k=l 


(7.3-13) 
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^  =  lvie  v29 


and  from  (7.3-11), 


Ye  =  Y  In1  ^(0) 


(7.3-14) 


(7.3-14) 


We  recall  that  _v^  is  the  ith  steering  vector.  It  is  easy  to  show  that 
the  denominator  of  Equation  (7.3-8)  is 
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lA 


I 


vH(0)  V  X<0)  |s(t)|2 


Combining  these  results  leads  to  the  result 


H  n 

i  -1.:  2 

A  =  — - r -  a 

v  (0)  v(8)  S 


(7.3-16) 


More  will  be  said  about  (7.3-16)  later  when  specific  situations  are 
considered . 


7.3.3  Derivation  for  the  Optimal  Processor 

A  closed  form  solution  for  the  optimal  structure  detection  index  is 
most  easily  derived  by  expressing  the  log-likelihood  ratio  in  its 
Hermitian  form  representation: 


l(r)  =  1  )jr 


(7.3-17) 


Since  _r  is  normally  distributed  under  either  hypothesis,  evaluating 
E{|z|2  |  Mg}  and  E{ | i | 2  |  H^}  is  equivalent  to  calculating  the  expecta¬ 
tion  of  the  product  of  random  Hermitian  forms.  The  solution  for  multi¬ 
variate  normal  distributions  is  well  known  and  available  in  the  statis¬ 
tical  literature  [48].  Using  these  results  leads  to  the  following: 


E{  K | 2  |  H1)  =  (trace  [(r"1  -  r"1  )  Rj ] ) 2 


+  2  trace  [(R^  -RjSrjCR^  -RjSiRj]  (7.3-18) 


The  arguments  of  the  trace  operator  can  be  simplified  since 
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(r"1  -  r'1)  R,  =  r"1  R.  -  I  =  R~ 1 ( R ,  -  Rm)  =  r"1  R  (7.3-19) 

— N  —1  —1  — N  —1  —  — N  —1  — N  — N  — y 

Substituting  (7,3-19)  into  (7.3-18)  gives 

E{ I l ! 2  !  Hl}  =  [crace  (R^1  Ry))2  +  2  trace  [(R^1  J^)2]  (7.3-20) 

Furthermore,  it  is  easy  to  show  that 

E{  |  £  |  2  |  H0)  =  [trace  (r"1  Ry))2  +  2  trace  [  (_R~ 1  Ry)2]  (7.3-21) 

Therefore,  the  detection  index  is  evaluated  by  substituting  these 
results  into  Equation  (7.3-1). 

7 . 4  Numerical  Simulation  Results 
7.4.1  Experimental  Description 

Equations  (7.3-16),  (7.3-20),  and  (7.3-21)  were  evaluated  numeri¬ 
cally  on  a  general  purpose  computer.  The  results  which  will  be  presen¬ 
ted  in  this  section  are  representative  of  many  scenarios  that  have  been 
s  tudied . 

The  array  consisted  of  eight  omnidirectional  sensors  spaced  equi- 
distantlv.  The  wavelength-to-spacing  ratio  was  0.5. 

The  signal  and  noise  sources  used  in  the  simulation  were  both  in 
the  far  field.  Wavefronts  which  represented  the  desired  signal  arrived 
at  the  array  from  -5° ,  0° ,  and  5°  due  to  the  multipath  propagation 
channel  £(•)•  The  array  look  direction  for  suboptimal  processing  was 
0°.  The  anisotropic  spread  noise  source  subtended  a  ten  degree  angle 


jf  lazAljCiJ 


(av-v; 


V  vv  v’ 
VVV 

''yC'-.-'-y 


cWa'J&L 


-’-y-rri 

•  k  *  ‘A 

•  vv.w. 


i.,i‘ .v 

-•  a  —  ■  a  a.  -  ^  JL.  K 


L.i.A.  A-i.  aV  x. 


and  was  centered  at  5°  with  respect  to  arrav  boresight.  Correlation 


coefficients  for  the  signal  wavefronts  were  given  by: 

*  ,  (0k  “  0£^  2lr(9k'0P), 

E{bk(«)bJl(U)}  =  exp{ - m - j  - ^50 - f  (7‘4"1) 

where  0^  is  the  arrival  direction  of  the  kth  wavefront  in  degrees.  This 

expression  was  used  to  calculate  the  entries  in  _Rg.  It  was  also  used  to 

generate  the  entries  in  the  noise  wavefront  correlation  matrix  Rjjjsj, 

implying  that  the  signal  and  noise  processes  have  similar  statistical 

2  2  2 

properties.  The  parameters  a  ,  ,  and  o^  were  adjusted  to  vary  the 

input  signal-to-noise  ratio. 

The  input  signal-to-noise  ratio  was  defined  as  the  average  signal 
power  summed  over  the  array  divided  by  the  average  noise  power  over  the 
array : 

SNR  =  10  log  (trace  _Ry/trace  _R^)  (7.4-2) 

7.4.2  Optimal  Versus  Suboptimal  Processing 

Figure  7-1  illustrates  processing  gain  as  a  function  of  input 
signal-to-noise  ratio.  These  results  show  that  optimal  processing 
affords  at  least  8  dB  improvement  over  suboptimal  processing. 

Receiver  operating  characteristics  for  input  signal-to-noise  ratios 
of  -20  dB,  -12  dB,  and  -9  dB  are  presented  in  Figures  7-2,  7-3,  and  7-4, 
respectively.  At  very  low  signal-to-noise  ratios  (less  than  -20  dB), 
the  performance  of  the  estimator-correlator  is  not  significantly  better 


»  y»/j.  A 


than  the  suboptimal  processor,  because  in  both  cases,  A  is  small,  and 
the  sum  1  +  A  is  approximately  one.  Therefore,  the  eight  dB  processing 
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gain  is  not  translated  into  an  improvement  in  detection  probability  for 
a  given  false  alarm  rate.  However,  once  the  SNR  is  greater  than  -20  dB, 
improvement  with  respect  to  this  criteria  can  be  clearly  seen. 
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7 . 5  Calculating  the  Estimator  Kernel 


7.5.1  Introduction 


The  second  half  of  this  study  actually  has  two  purposes:  first,  to 
demonstrate  that  the  generalized  singular  value  decomposition  can  be 
used  to  construct  G( • , • )  directly  from  data,  and  second,  to  determine 
what  signal-to-noise  ratio  is  needed  to  estimate  _Ry(*,»)  (or  to  form 
data  matrix  B).  This  is  an  important  parameter.  Of  course,  the  probing 
signal  is  under  our  control,  therefore  in  principle,  the  input  SNR  could 
be  raised  to  any  desired  magnitude.  However,  in  practical  situations 
this  is  clearly  not  possible;  therefore,  the  ability  to  estimate  _Ry( •,• ) 
and  £ ( • )  at  a  moderate  SNR  is  highly  desirable.  If  an  input  SNR  of  80 
dB  is  needed  for  identification,  one  can  safely  conclude  that  this 
approach  to  adaptive  implementation  is  impractical! 


7.5.2  Experimental  Description 

The  multipath  channel  model,  noise  models,  and  array  model  are 
identical  to  those  presented  in  Sections  7.2  and  7.4.1.  Both  _Ry  and  _R^ 
were  normalized  so  that  each  had  unit  trace,  meaning  that  the  nominal 
input  SNR  was  0  dB. 
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In  order  to  generate  simulated  array  measurements,  a  standard  IMSL 


routine  was  used  to  first  generate  sets  of  white  Gaussian  noise.  Next, 


the  white  noise  was  formed  into  random  vectors,  and  filtered  by  the 


square  roots  of  _Ry  and  in  order  to  construct  random  vectors.  After 


filtering,  the  random  vectors  represented  array  data  given  signal  alone 


and  noise  alone,  respectively.  Noisy  measurements  of  the  channel  output 


were  constructed  by  scaling  the  signal  alone  vectors  and  adding  them 


to  the  noise  alone  vectors. 


7.5.3  Results 


Several  generalized  singular  value  decomposition  algorithms  were 


made  available  to  the  author  courtesy  of  Charles  Van  Loan  of  Cornell 


University.  After  modifying  several  of  them  for  complex-valued 


matrices,  they  were  tested  in  two  ways.  First,  we  attempted  to  calcu¬ 


late  G  from  the  square  roots  of  the  actual  covariance  matrices  Rq  and 


_Ry.  To  do  this,  their  eigenvalue  decompositions  were  computed,  and 


matrices  A  and  B  were  defined  as  follows: 


.  _l/2  .1/2 

A  =  Rj  =  Aj  il| 


(7.5-1) 


B  -  Ri/2  -  A*'2  U 

-  -y  -y  -y 


(7.5-2) 


It  is  clear  from  Equations  (7.5-1)  and  (7.5-2)  that 


Ah  A  =  Ri 


(7.5-3) 


1H  1  =  Ay 


(7.5-4) 
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By  following  the  procedure  in  Section  6.3.3,  it  should  be  possible  to 
calculate  _G  from  (7.5-1)  and  (7.5-2). 

We  found  that  the  procedure  discussed  above  worked  perfectly.  The 
result  was  compared  with 


G  =  R  R?1 

-  -y  -1 


and  the  answers  matched  exactly.  We  concluded  that  the  algorithms 
worked  correctly,  and  that  it  should  be  possible  to  compute  G  from  data. 

Next,  we  attempted  to  calculate  G( • , • )  directly  from  array  data 
matrices.  Data  matrix  A  was  formed  from  256  array  measurements  given  a 
priori  knowledge  that  Hi  was  true.  The  data  matrix  B  was  formed  by 
increasing  the  probing  signal  power,  taking  256  array  measurements,  and 
then  scaling  the  matrix  by  a  factor  of  1/js.  This  step  normalized  the 


trace  of  _Rv  to  the  proper  value,  which  in  this  example  was  unity.  The 


dimensions  of  A  and  _B  were  256  by  8.  These  matrices  were  input  into  the 
generalized  singular  value  decomposition  algorithm,  and  the  processing 
steps  described  in  Section  6.3.3  were  followed  in  an  attempt  to  compute 
£(•«*). 

Unfortunately,  these  attempts  were  unsuccessful.  The  data  matiixjS 
was  constructed  at  signal-to-noise  ratios  ranging  from  eight  to  25  dB, 
and  in  all  cases,  the  algorithm  was  unstable.  We  do  not  know  why  the 
algorithms  did  not  work  during  these  trials;  however,  since  the  algor¬ 
ithms  are  very  new,  it  is  likely  that  the  computer  programs  have  not 
been  perfected. 
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Another  approach  to  the  problem  was  tried.  Since  the  algorithms 
worked  for  the  square  roots  of  and  _Ry,  we  computed  the  covariance 

matrix  estimates  R,  and  R  by 

—1  -y 


R  =  A  A 


Ry  =  _B  _B 


Of  course,  this  procedure  defeats  the  reason  why  the  CS  decomposition 
was  proposed  in  the  first  place,  but  we  wanted  to  demonstrate  in  prin¬ 
ciple  that  G( * , * )  could  be  constructed  from  data!  Next,  eigenvalue 
decompositions  of  R.^  and  _Ry  were  computed,  and  their  square  roots 
formed.  The  processing  procedure  in  Section  6.3.3  was  carried  out  using 
the  square  roots  of  the  estimated  covariance  matrices  in  another  attempt 
to  obtain  _£(•,•)  from  array  data. 

This  approach  was  successful,  and  furthermore,  accurate  estimates 
of  _Ry  could  be  made  at  moderate  signal-to-noise  ratios.  An  input  SNR  of 
15  dB  was  sufficient  for  excellent  identification.  This  judgement  was 
empirical,  because  we  compared  the  actual  _Ry  to  the  estimated  matrix 
element  by  element.  As  the  input  SNR  was  increased,  we  found  little 
improvement  after  an  SNR  of  15  dB  was  attained.  We  concluded  that  the 
identification  scheme  is  feasible,  and  can  be  carried  out  at  a  moderate 
SNR.  Although  it  is  not  possible  to  generalize  these  results  with 
absolute  certainty,  it  seems  reasonable  to  conclude  that  they  carry  over 
to  other  channel  models. 
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7.6  Relating  Identification  to  Processor  Performance 


The  results  from  Section  7.5  demonstrated  that  _Ry(*,*)  could  be 
identified  accurately  at  moderate  signal-to-noise  ratios.  However,  they 
did  not  made  a  connection  between  identification  and  optimal  processor 
performance.  Therefore,  further  experiments  were  conducted  in  order  to 
determine  how  well  _Ry ( * , * )  must  be  identified  to  obtain  processing  gain 
improvement  as  compared  to  suboptimal  methods. 

The  channel,  noise,  and  array  models  were  identical  to  those 
presented  in  Sections  7.2,  7.4.1,  and  7.5.2.  The  optimal  detection 
index  A  (Equation  (7.3-1))  was  evaluated  for  _Ry(*»')  identified  at  low 
and  high  signal-to-noise  ratios.  Closed-form  expressions  for  E{|z|2|H]} 
and  E{ | Z| 2 | Hq}  given  misidentified  Ry(*,*)  can  be  obtained;  however, 
thev  are  difficult  to  evaluate  analytically.  Therefore,  Monte  Carlo 
methods  were  used  to  evaluate  A. 

The  results  of  our  experiments  are  illustrated  in  Figure  7-5.  They 
are  interesting  and  intuitively  pleasing.  As  the  signal-to-noise  ratio 
increases,  the  processing  gain  approaches  the  theoretical  predicted 
maximum.  Little  processing  improvement  is  achieved  above  an  identifi¬ 
cation  SNR  of  15  dB,  which  makes  the  connection  between  receiver  per¬ 
formance  and  the  empirical  observation  made  in  the  previous  section. 
However,  the  most  significant  conclusion  of  this  experiment  is  that  even 
a  poor  identification  results  in  improved  detection.  Perfect  identifi¬ 
cation  at  high  signal-to-noise  ratios  is  not  needed. 


7.7  Conclusions 

Matrix  representations  can  be  used  to  model  non-trivial  propagation 
and  scattering  channels,  and  are  a  useful  tool  for  computer  simulation 
experiments.  In  this  chapter,  they  were  used  successfully  to  evaluate 
the  estimator-correlator  processor,  compare  its  performance  against  a 
suboptimal  receiver,  and  to  generate  simulated  array  measurements  for 
testing  new  computational  algorithms. 

The  relationship  between  identification  and  receiver  performance 
was  examined.  We  found  that  perfect  identification  was  not  required  to 
improve  processing  gain.  Even  a  poor  identification  conducted  at  a  low 
SNR  resulted  in  some  improvement. 

Also,  the  performance  of  optimal  versus  suboptimal  processing  was 
evaluated  numerically.  It  was  found  that  optimal  processing  gives  at 
least  eight  dB  improvement  over  suboptimal  techniques.  For  input 
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signal-to-noise  ratios  greater  than  -20  dB,  this  translated  into  an 
improved  receiver  operating  characteristic. 

The  performance  of  new  generalized  singular  value  decomposition 
algorithms  was  evaluated.  We  had  some  difficulties  with  them,  yet  were 
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ible  to  show  that  C( • , • )  could  be  computed  in  principle.  There  is 


nothing  incorrect  with  the  proposed  processing  scheme.  The  programs 
which  were  tested  are  new,  and  require  further  testing  and  debugging. 
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Chapter  8 


CONCLUDING  REMARKS 

8. 1  Conclusions 

The  estimator-correlator  processor  establishes  fundamental  connec¬ 
tions  among  detection  theory,  estimation  theory,  system  modeling,  and 
system  identification  theory.  Variations  of  this  canonical  structure 
solve  the  detection  problem  for  the  generalized  exponential  class  of 
signal  and  noise  distributions.  This  result  establishes  a  basic 
connection  between  detection  and  estimation  theory. 

An  operator  theoretic  approach  to  the  channel  representation 
problem  allows  the  detection  and  modeling  problems  to  be  solved  for  a 
very  wide  class  of  transmission  media,  especially  spread  channels,  which 
are  particularly  difficult  to  handle.  Moreover,  this  systematic 
approach  made  the  full  power  of  Hilbert  space  theory  and  functional 
analysis  available  for  use  in  subsequent  derivations. 

Matrix  representations  of  bounded,  linear  operators  are  useful  for 
modeling  a  wide  range  of  deterministic  and  stochastic  transformations 
one  might  encounter  in  practical  array  processing  problems.  They  are 
easily  incorporated  into  the  estimator-correlator  structure.  Identifi¬ 
cation  of  £(•)  as  represented  by  matrices  is  needed  in  order  to  calcu¬ 
late  the  conditional  mean  of  the  channel  output  £s(t).  Measuring  the 
matrix  element  cross-correlations  represents  a  systematic  approach  to 
stochastic  Green's  function  identification. 

The  channel  identification  problem  can  be  simplified  through 
simultaneous  diagonalizat ion  of  the  input  and  output  covariance  kernels. 


Simultaneous  diagonalization  can  be  accomplished  either  through  signal 
design  or  generalized  singular  value  decomposition.  The  result 
establishes  an  interesting  connection  among  detection,  estimation, 
system  modeling,  and  identification,  and  in  addition,  provides  new 
insight  into  classical  system  identification  issues.  The  Karhunen-Loeve 
expansion  furnishes  a  fundamental  structure  for  stochastic  system 
modeling  and  identification. 

Solving  the  space-time  processor  equations  through  orthogonal 
decompositions  represents  the  most  important  accomplishment  of  this 
dissertation.  Karhunen-Loeve  representations  are  the  key  to  both 
theoretical  analysis  and  adaptive  implementation.  Important  connections 
between  this  expansion  and  other  decompositions,  including  the  singular 
value  decomposition,  CS  decompositions,  generalized  eigenvalue  factori¬ 
zations,  QR  factorizations,  and  generalized  Fourier  series  have  been 
made.  Combinations  of  these  canonical  decompositions  and  orthonormal 
representations  provide  the  key  to  implementing  the  processor  with 
numerically  robust  algorithms. 

The  array  processing  algorithms  proposed  in  this  dissertation  are 
more  than  academic  ideas  that  can  not  work  in  practice.  They  were 
thoroughly  tested  and  they  work  well.  The  computational  burden  is  worth 
the  effort,  because  optimal  processing  is  significantly  better  than 
simpler  suboptimal  techniques.  Identification  can  be  carried  out  at 
moderate  signal-to-noise  ratios.  Moreover,  perfect  channel 
identification  is  unnecessary.  Even  a  poor  identification  improves 


receiver  performance. 
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8.2  Recommendations  for  Future  Research 

The  numerical  problems  involved  with  implementing  the  estimator- 


pkm 

wV-.V»  i 


correlator  must  be  examined.  It  is  not  known  how  finite-precision  word 
lengths,  matrix  ill-conditioning,  and  channel  misidenti f ication  work 


together  to  affect  the  performance  of  the  estimator-correlator. 
Understanding  these  effects  is  crucial  in  order  to  build  the  processor 
in  hardware. 
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Several  theoretical  issues  warrant  further  research.  For  example, 
some  recent  work  suggests  that  the  structure  of  _L( • )  gives  insight  into 
"how  far"  a  stochastic  system  deviates  from  stationarity .  This  idea 
needs  to  be  developed,  since  it  has  the  potential  to  give  new  insights 
into  stochastic  system  characterization. 

In  this  dissertation,  the  theoretical  foundations  of  optimal  space- 
time  array  processing  have  been  examined  at  length.  We  demonstrated  how 
the  processing  equations  can  be  solved  and  implemented  with  robust 
numerical  algorithms.  It  is  clear  from  Chapter  6  that  implementing  the 
processor  is  very  demanding  computationally.  However,  the  data  flow  and 
computations  are  regular,  repetitive,  and  well  suited  for  parallel 
computations  by  distributed  processors.  Algorithms  for  performing 
singular  value  decompositions  and  CS  decompositions  that  are  amenable  to 
parallel  processing  or  systolic  array  implementation  need  to  be 
developed.  Solving  these  problems  will  require  new  basic  computational 
cells  and  new  methods  to  assess  their  computational  complexity. 

Developing  new  basic  computational  cells  will  require  a  deeper 
understanding  of  the  fundamental  structure  of  computational  algorithms. 


'a.v.-.v/tv, 

v.v.vM 


v  V  V  V* 

-  V>  y,V> 

■vvvv rS 


■a 


•  v  v  ■>  v  > 

*.v.v.v.v-.y3 

-.V.-.-.V.'-.V, 


1.  «  I  '  .  *  I  *  ,  *  ,  >  .  •  .  *  .  *  .  1  ."V  |N  _%  A  A  A  V*\  .v  '»  «*•  •*  N*  *•'  v  V 


•v.v; 


2  44 


For  example,  a  basic  algorithm  used  in  many  signal  processing  operations 
is  the  generalized  coordinate  rotation  [49].  In  many  applications,  it 
can  he  regarded  as  more  fundamental  than  even  the  traditional  floating 
point  operation  (complex  multiply-and-accumulate ) .  Surprisingly  enough, 
the  generalized  coordinate  rotation  algorithm  has  a  fundamental  connec¬ 
tion  with  He  group  theory,  an  abstract  mathematical  discipline  [50]. 
More  work  in  this  area  is  needed  to  establish  deeper  connections  between 
Lie  group  theory  and  basic  signal  processing  algorithms. 

The  connection  between  the  work  presented  in  this  dissertation  and 
other  basic  research  areas  in  adaptive  signal  processing  is  presented  in 
Figure  8-1.  The  overall  effort  calls  on  disciplines  such  as  Lie  group 
theory,  graph  theory,  and  information-theoretic  analysis  of  computa¬ 
tional  complexity,  in  addition  to  stochastic  operator  theory,  numerical 
analysis,  and  integral  equation  theory.  All  of  these  tools  contribute 
to  the  understanding  needed  for  efficient  optimal  array  processor 
design. 
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APPENDIX 


CONTOUR  INTEGRAL  EVALUATION 
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Calculating  the  matrix  representations  for  the  operators  appearing 
in  Sections  3.2.3,  3.2.5,  and  3.2.6  requires  evaluating  three  definite 


integrals . 


The  representation  for  the  time  delay  operator  is: 


1  r  sin  p(t  -  t  -  nT)  sin  o(t  -  nT) 


x.  ,  ,  s  _  i  r  sm  ovc  ~  T 

"<AtW't  f  ~  'al  ~T  - 


mT )  a( t -  nT) 


dt  ( A— 1 ) 


To  evaluate  Equation  (A-l),  begin  by  simplifying  the  product  in  the 
numerator  using  the  trigonometric  identity 


sin  a  sin  8  =  cos(a  -  3)  ~  cos(a  +  8) 


By  setting  a  =  at  +  maT  and  b  =  naT  then 


sin  a(t  -  t  -  mT)sin  a(t  -  nT)  =  sin(aT  -  a)sin(at  -  b)  = 
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—  cos(2ot  -  (a  +  b)) 


and  (A-l)  becomes 


V\-\- V- 


1  r  cos ( b  -  a)  1  c  cos(2ot  -  (a  +  b))  „ , 

“mn  If  J  CaT- 1  )Tot~-  ~)  dt  ~~  J  Ca-2) 
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Evaluate  the  first  integral  in  (A-2): 


+<© 


—  / 

O'J*  j 


cos(b  -a) 
(at  -  a ) ( at  - 


b) 


dt  = 


cos(b  -  a) 

2Ta“ 


/ 


dt 


(t  -  a/o ) ( t  -  b/a) 


( A- 3 ) 


Convert  (A-3)  into  a  contour  integral  on  the  z-plane: 

cos(h  -  a) 


f  - 


2Ta‘ 


dz _ 

(z  -  a/a)(z  -  b/a ) 


(A-4) 


where  C  is  shown  in  Figure  A-l.  Reference  [ 5 1 1  shows  that 


dz 


(z  -  a/a)(z  -  b/a) 


=  /  f(z)  dz  =  iri  £  Res  f(z) 


with  Res  f(z)  meaning  the  residue  of  f(z).  The  residue  at  x  =  a/a  is 


1 


a/a  -  h/a  (a  -  b) 


The  residue  at  x  =  b/a  is 


1 


b/a  -  a/a  ( b  -  a ) 


There  fore , 


dz 


(z  -  a/a)(z  -  b/a) 


=  0 


The  second  integral  shall  be  calculated  in  a  similar  manner: 


cos ( 2at  -  (a  +  b) ) 
(at  -  a)(at  -  b) 


dt  =  Re  {  j 


exp(i(2ot  -  (a  +  b)) 
(at  -  a ) ( at  -  b) 


dt 


( A-5) 
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Since 


a  -  b  =  ctt  -  maT  -  naT  =  a(t  -  (n  -  m)T) 


the  coefficients  are 


sino( i  -  (n  -  m)T) 
o(t  -  (n  -  m)T) 


( A-8 ) 


which  is  the  answer. 

The  matrix  representation  of  the  stretching/compression  operator 


is  given  by: 


JL  f  s:i-n  cr(at  -  mT)  sin  o(  t  -  nT) 

T  '  a( at  -  mT  a(t  -  nT)  ^ 


(A-9 ) 


for  a  >  0. 

The  calculation  is  straightforward  and  similar  to  the  first  example. 
The  answer  is: 


sin(oT(an  -  m)/a) 


nm  iffUan  -  m)/ 


( A-l 0 ) 


To  check  the  answer,  if  a  =  1,  then 


anm  “  ^  41  n »  ^  _  ^mi 


which  is  the  identity  operator, 
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